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This application is a continuation-in-part of U. S . Patent Application Serial 
Number 08/888,012, filed July 3, 1 997, which is herein incorporated by reference. 

Background of the Invention 

Field of the Invention 

The present invention relates to a novel GABA A receptor € subunit 
(GABRE) and an alternative transcript thereof (ET2). More specifically, isolated 
nucleic acid molecules are provided encoding human GABRE and ET2 receptor 
subunits. ET2 and GABRE polypeptides are also provided, as are vectors, host 
cells and recombinant methods for producing the same. The invention further 
relates to screening methods for identifying agonists and antagonists of ET2 and 
GABRE activities. Also provided are diagnostic methods for detecting aberrant 
GABRE and ET2 expression, as well as therapeutic methods for treating disorders 
involving ET2 and GABRE. 

Related Art 

In mammalian brain, synaptic inhibition of neuronal activity is mediated 
mainly by the action of the neurotransmitter y-aminobutyric acid (GABA) at 
GAB A-gated chloride ion channels. GABA is an amino acid derivative which is 
produced in neurons by the decarboxylation of glutamate. 

GAB A A receptor complexes are multi-subunit members of a ligand-gated 
ion channel gene superfamily . The binding of GABA to these complexes results 
in the opening of channels across the neuronal cell membrane through which 
anions flow down their concentration gradients. Chloride ions are the main anions 
conducted through these channels, and channel opening requires the binding of at 
least two GABA molecules per GABA A receptor complex. Thus, the binding of 
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GABA to GABA A receptor complexes induces an increase in chloride ion 
conductance across the postsynaptic membrane which alters the chloride ion 
gradient resulting in both hyperpolarization of this membrane and inhibition of 
neuronal firing. After interaction with the extracellular regions of the GAB A A 
5 receptor complexes, GABA is actively "pumped" back into the prejunctional 

neurons. 

Neurotransmitter responsive ligand-gated ion channels are known to be 
major targets for psychoactive drugs (Sargeant ? P. ? v4www. Rev. Neurosci. 76:403- 
443 (1993)). The GABA A receptor complex, for example, is the primary site of 
41 10 action for many of the drugs used to treat anxiety and seizure disorders such as the 

benzodiazepines, barbiturates and tranquilizers (e. g. , Valium and Librium). These 
!j: receptors serve as molecular control elements through which the levels of anxiety, 

IJ vigilance, muscle tension, and epileptic activity, as well as other conditions, can 

■ 

s be regulated by drug-induced modulations. 

f< 15 Heterogeneity with respect to subunit types which make up 

y transmitter-gated ion channels has been shown for several multi-subunit receptor 

1 complexes. The alternative forms of these subunits are generally either encoded 

by distinct genes or arise due to alternative mRNA splicing (see generally Alberts, 
B. et al } Molecular Biology of the Cell, 3rd edition, Garland Publishing, Inc. 
20 (1994)). 

The GAB A A receptor complex is believed to assemble into a pentameric 
structure and, to date, at least fourteen mammalian GAB A A receptor subunits have 
been identified (a 1-6, pi -3, yl-3, 5, e). These receptor subunits fall within five 
families on the basis of amino acid homology (Whiting, P.J., et al 3 Int. Rev. 
25 Neurobiol 38:95-138 (1995); Davies, P.A., et al, Nature 555:820-823 (1997)). 

While the majority of functional receptors contain ct/p/y or oc/p/5 subunit 
combinations, additional subunits combinations can form GABA-activated 
chloride channels. 



Variations in subunit combinations can result in different pharmacological 
properties being conferred upon the receptor complex (Davies, P.A. et al, Nature 
555:820-823 (1997); reviewed in Whiting, PJ. etal, Int. Rev. Neurobiol 38:95- 
138 (1995)). Thus, subpopulations of GABA A receptor complexes show differing 
sensitivity to GABA, steroid modulators, physiological regulation, disease 
processes, and pharmacological manipulation by drugs (e.g., benzodiazepines). 
The distributions of mRNAs encoding different GABA A receptor subunit 
polypeptides and their subtypes localized in the brain show significant regional 
variation consistent with pharmacological and biochemical evidence for receptor 
heterogeneity. Further, alterations in brain specific expression of GAB A A receptor 
complex subunit polypeptides has been identified in human brain tissues of 
individuals suffering from alcoholism (Lewohl, J. et al, Brain Res. 751 : 1 02-1 1 2 
(1997)). 

Thus, the properties of the different subpopulations of GAB A A receptor 
complexes are determined, at least in part, by the subunits expressed in the 
particular cell. The GABA A receptor complex e subunit, for example, confers 
several unique pharmacological and biophysical properties when assembled in a 
GAB A A receptor complex containing a and P subunits (Davies, P.A. etal, Nature 
555:820-823 (1 997)). Most notably, this subunit inhibits the ability of anesthetic 
agents to potentiate GAB A-gated chloride currents. 

While much work has been done with GABA A receptor complexes 
localized in the brain, functional GAB A A receptors have been identified in rat heart 
(Matsuyama, S. etal, Am. J. Physiol 2^:1057-1061 (1993); McLemore, G.L. 
et al, Pharmacology 4P:342-350 (1994)). Further, GABA A receptor subunit 
mRNA has also been identified in human pancreatic tissue (Yang, W. et al> FEES 
Lett 346:257-262 (1994)) and, more recently, in tissues of the human 
reproductive system (Hedblom, E. and Kirkness, E., J. Biol Chem. 272:15346- 
15350 (1997)). Currently, little is known about the subunit composition or 



pharmacological characteristics of these extra-central nervous system GABA A 
receptor complexes. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
polynucleotides encoding the ET2 and GABRE polypeptides having the amino 
acid sequences shown in SEQ ID NO:2 and SEQ ID NO:42. The nucleotide 
sequence shown in SEQ ID NO:l was determined by direct sequencing of PCR 
amplified ET2 cDNA, as described in Example 1 . This DNA sequence contains 
an open reading frame encoding a polypeptide of about 242 amino acid residues. 
The amino acid sequence of the ET2 receptor subunit is shown as amino acid 
residues from about 1 to about 242 in SEQ ID NO:2. The nucleotide sequence 
shown in SEQ ID NO:41 was determined as described in Example 6. This DNA 
sequence contains an open reading frame encoding a polypeptide of about 506 
amino acid residues. The amino acid sequence of the GABRE receptor subunit 
is shown as amino acid residues from about -1 8 to about 488 in SEQ ID NO:42. 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the GABRE receptor subunit polypeptide 
having the amino acid sequence shown in SEQ ID NO:42 or the amino acid 
sequence encoded by the cDNA clone deposited as plasmid DNA on February 25, 
1998 and assigned ATCC Deposit Number . 

Thus, one aspect of the invention provides an isolated nucleic acid 
molecule comprising a polynucleotide having a nucleotide sequence selected from 
the group consisting of: (a) a nucleotide sequence encoding the ET2 or GABRE 
polypeptide having the complete amino acid sequence in SEQ ID NO: 2 or SEQ 
ID NO:42; (b) a nucleotide sequence encoding the ET2 or GABRE polypeptide 
having the complete amino acid sequence in SEQ ID NO:2 or SEQ ID NO:42 but 
minus the N-terminal methionine residue; (c) a nucleotide sequence encoding the 



GABRE polypeptide having the amino acid sequence at positions from about 1 to 
about 488 in SEQ ID NO:42; (d) a nucleotide sequence encoding the polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 

Deposit No. ; (e) a nucleotide sequence encoding the mature GABRE 

polypeptide having the amino acid sequence encoded by the cDNA clone 

contained in ATCC Deposit No. ; (f) a nucleotide sequence encoding one 

or more ET2 or GABRE transmembrane domains; (g) a nucleotide sequence 
encoding the ET2 or GABRE intracellular domain; (h) a nucleotide sequence 
encoding the GABRE extracellular domain; (i) a nucleotide sequence encoding the 
ET2 or GABRE protein with all or part of one of more of the transmembrane 
domains deleted; and (j) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), (g), (h), or (i). 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
95% identical, and more preferably at least 96%, 97%, 98% or 99% identical, to 
any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), (g), (h), (i), or (j) 
above, or a polynucleotide which hybridizes under stringent hybridization 
conditions to a polynucleotide in (a), (b), (c), (d), (e), (f), (g), (h), (i), or (j) above. 
This polynucleotide which hybridizes does not hybridize under stringent 
hybridization conditions to a polynucleotide having a nucleotide sequence 
consisting of only A residues or of only T residues. An additional nucleic acid 
embodiment of the invention relates to an isolated nucleic acid molecule 
comprising a polynucleotide which encodes the amino acid sequence of an 
epitope-bearing portion of an ET2 polypeptide having an amino acid sequence in 
(a), (b), (c), (d), (e), (f), (g), (h), or (i) above. 

The present invention also relates to recombinant vectors, which include 
the isolated nucleic acid molecules of the present invention, and to host cells 
containing the recombinant vectors, as well as to methods of making such vectors 



and host cells and for using them for production of ET2 and GABRE 
polypeptides, and peptides thereof, by recombinant techniques. 

The invention further provides an isolated polypeptides having an amino 
acid sequence selected from the group consisting of: (a) the amino acid sequence 
of the ET2 polypeptide having the complete 242 amino acid sequence; (b) the 
amino acid sequence of the GABRE polypeptide having the complete 506 amino 
acid sequence; (c) the amino acid sequence of the ET2 polypeptide having the 
complete 242 amino acid sequence but minus the N-terminal methionine residue; 
(d) the amino acid sequence of the GABRE polypeptide having the complete 506 
amino acid sequence but minus the N-terminal methionine residue; (e) the amino 
acid sequence of the mature GABRE protein; (f) the amino acid sequence of one 
or more ET2 or GABRE transmembrane domains; (g) the amino acid sequence 
of the ET2 or GABRE intracellular domain; (h) the amino acid sequence of the 
GABRE extracellular domain; and (i) the amino acid sequence of the ET2 or 
GABRE protein with all or part of one or more of the transmembrane domains 
deleted. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence at least 95% identical, and still more preferably 96%, 
97%, 98% or 99% identical to those described in (a), (b), (c), (d), (e), (f), (g), (h), 
or (i) above. 

An additional embodiment of this aspect of the invention relates to a 
peptide or polypeptide which has the amino acid sequence of an epitope-bearing 
portion of an ET2 or GABRE polypeptide having an amino acid sequence 
described in (a), (b), (c), (d), (e), (f), (g), (h), or (i) above. Peptides or 
polypeptides having the amino acid sequence of an epitope-bearing portion of an 
ET2 or GABRE polypeptide of the invention include portions of such 
polypeptides with at least six or seven, preferably at least nine, and more 
preferably at least about 30 amino acids to about 50 amino acids, although 
epitope-bearing polypeptides of any length up to and including the entire amino 



acid sequence of a polypeptide of the invention described above also are included 
in the invention. In another embodiment, the invention provides an isolated 
antibody that binds specifically to an ET2 or GABRE polypeptide having an amino 
acid sequence described in (a), (b), (c), (d), (e), (f), (g), (h), or (i) above. Such 
antibodies are useful diagnostically or therapeutically as described below. 

The present invention also provides a screening method for identifying 
compounds capable of enhancing or inhibiting an ET2 and/or GABRE receptor 
subunit activity, which involves contacting cells which express an ET2 and/or 
GABRE receptor subunit with both a stimulatory molecule and a candidate 
compound, assaying a cellular response, and comparing the cellular response to 
both a first standard cellular response, made in absence of the candidate 
compound, and a second standard cellular response, made in absence of an ET2 
and/or GABRE polypeptide, whereby an increased cellular response in comparison 
to both the first and second standards indicates that the compound is an agonist 
and a decreased cellular response in comparison to both the first and second 
standards indicates that the compound is an antagonist. 

In another aspect, a screening assay for agonists and antagonists is 
provided which involves determining the effect a candidate compound has on the 
binding of ligands to an ET2 and/or GABRE receptor subunit. In particular, the 
method involves contacting an ET2 and/or GABRE receptor subunit with one or 
more additional GAB A A receptor subunit polypeptides and a candidate compound 
and determining whether ET2 and/or GABRE receptor subunit ligand binding 
activity is increased or decreased due to the presence of the candidate compound. 

For a number of disorders involving GABA A receptors, it is believed that 
significantly higher or lower levels of ET2 and/or GABRE expression can be 
detected in certain tissues (e.g., heart, placenta, pancreas, spleen, thymus, 
prostate, testis, ovary, small intestine, and colon) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such 
a disorder, relative to a "standard" ET2 and/or GABRE receptor subunit 
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expression level, /. e. , an ET2 and/or GABRE receptor subunit expression level in 
tissue or bodily fluids from an individual not having the disorder. Thus, the 
invention provides a diagnostic method useful during diagnosis or prognosis of a 
disorder related to ET2 and/or GABRE expression levels which are significantly 
5 higher or lower than standard levels. This diagnostic method involves: (a) 

assaying the ET2 and/or GABRE receptor subunit expression level in cells or body 
fluid of an individual; (b) comparing the ET2 and/or GABRE expression level with 
a standard ET2 and/or GABRE receptor subunit expression level, whereby a 
significant increase or decrease in the assayed ET2 and/or GABRE receptor 

„4 0 subunit expression level compared to the standard expression level is indicative of 

a disorder involving ET2 and/or GABRE receptor subunit expression. 

y An additional aspect of the invention is related to a method for treating an 

individual in need of an increased level of ET2 and/or GABRE activity in the body 

lU comprising administering to such an individual a composition comprising a 

Pi 

s 15 therapeutically effective amount of an isolated ET2 and/or GABRE polypeptide 

51 of the invention or an agonist of ET2 and/or GABRE receptor subunit activity, 
ili A still further aspect of the invention is related to a method for treating an 

=!! individual in need of a decreased level of ET2 and/or GABRE receptor subunit 

w activity in the body comprising, administering to such an individual a composition 

20 comprising a therapeutically effective amount of an antagonist of ET2 and/or 
GABRE receptor subunit activity. 

Brief Description of the Figures 

FIG. 1 shows the genomic structure of GABRE. The top illustration 
represents the human e subunit mRNA. The protein-coding region is shaded. The 
25 start and end of exons are indicated by oblique lines that join the mRNA to the 

genomic DNA below. The representation of genomic DNA indicates the location 



of exons (boxes) and restriction sites for Kpnl and Xbal (K, X). The bottom 
portion of the figure illustrates the cloned fragments of human genomic DNA. 

FIG. 2 shows the alternative transcripts of GABRE. The top illustration 
depicts exons of GABRE that are transcribed to yield alternative mRNAs. A 3.6 
kb transcript is derived from 9 exons (open rectangles) and a 6.2 kb transcript is 
derived from 3 exons (filled rectangles). Fragments of GABRE (A, B, C, E) that 
were used to probe the RNA blots are also illustrated. 

FIG. 3 shows the conservation of nucleotide sequence between the human 
and rat e subunit genes. The sequence of GABRE between exons 4 and 9 was 
compared to the corresponding region of the homologous rat gene. The eight 
regions of highest similarity correspond to the six exons (numbered rectangles) 
and two segments of intron 6 (A and B). 

FIG. 4 shows the amino-acid sequence of the human e-subunit (SEQ ID 
NO:42). The e-subunit polypeptide is aligned with a consensus sequence of all 
known mammalian GABA A receptor subunits. The potential signal sequence (S) 
and four putative transmembrane domains (M1-M4) are highlighted by lines under 
the corresponding peptide sequences. Eight differences between the e-subunit 
sequence and the consensus are indicated by diamonds. 

FIG. 5A-5C show the effects of the e-subunit on activation of receptors 
by GABA. FIG. 5A shows the concentration-dependent activation of currents 
recorded from cells transfected with either a2/p 1 (open circles) or a2/p 1/e (filled 
circles) cDNAs. From the logistic fit (Adodra, S. and Hales, T. G., Br. J. 
Pharmacol. 775:953-960 (1 995)), the EC 50 values are 16.4 and 1 1 .2 |iM, and Hill 
slopes are 1.3 and 1.1, respectively. Data points represent current amplitudes 
recorded from at least 3 cells, normalized to the maximum current of each cell. 
FIG. 5B shows the current-voltage relationships for responses evoked by 100 \\M 
GABA, recorded from cells transfected with all, pi (open circles) or a2/pl/e 
(filled circles) cDNAs. From the respective exponential and linear fits to the data 
points (Adodra, S. and Hales, T. G., Br. J. Pharmacol 775:953-960 (1995)), 
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currents reversed at 4.4 and 5.7 mV. Data points represent the mean current 
amplitudes recorded from at least 4 different cells and normalized to the peak 
current amplitude of each cell. The insets are superimposed traces showing 
averages of two leak-subtracted currents recorded from individual cells when held 
at voltages between 60 and -60 mV. FIG. 5C shows the ratio of GABA (100 
(iM)-evoked current amplitudes, recorded at 60 and -60mV, for different a/p and 
a/p/e subunit combinations. Data are mean values from 4-18 cells. 

FIG. 6A-6D show the effects of the € subunit on the modulation of 
GABA A receptors by anaesthetic agents. FIG. 6A shows the modulation by 
pregnanolone of [ 35 S]TBPS binding to the membranes of cells transfected with 
either a2/pl (open circles) or a2/p/e (filled circles) cDNAs. FIG. 6B shows the 
modulation of GABA (1 00 (iM)-evoked currents by pregnanolone, recorded from 
cells expressing a2/pl (open circles) or a2/p/e (filled circles) cDNAs. Data 
points represent mean values from at least 4 cells, fitted using the logistic function 
(Adodra, S. and Hales, T. G., Br, J, Pharmacol 775:953-960 (1995)). The insets 
are superimposed traces showing currents recorded in the absence or presence 
(asterisks) of pregnanolone (100 nM). FIG. 6C shows the modulation of 
[ 3 H]muscimol binding to cell membranes by pregnanolone (circles) and 
pentobarbital (squares) after transfection with either al/p3 (open symbols) or 
a l/p3/e (filled symbols) cDNAs. FIG. 6D shows the effects of pentobarbital (1 00 
\iM) and propofol (3 |iM) on GABA (100 |aM) evoked currents recorded from 
cells expressing different oc/p and a/p/e subunit combinations. Histogram bars 
represent mean data from at least 4 cells. 

FIG. 7 shows a schematic representation of the pHE4a expression vector 
(SEQ ID NO:45). The locations of a number of restriction endonuclease cleavage 
sites, the kanamycin resistance marker gene, and the laclq coding sequence are 
indicated. 

FIG. 8 shows the nucleotide sequence of the regulatory elements of the 
pHE4a promoter (SEQ ID NO:46). The two lac operator sequences, the Shine- 



Delgarno sequence (S/D), and the terminal Hindlll and Ndel restriction sites 
(italicized) are indicated. 

Detailed Description 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding an ET2 polypeptide having the amino acid sequence 
shown in SEQ ID NO:2, which was determined by sequencing a PCR amplified 
cDNA clone, as described in Example 1 . The present invention also provides 
isolated nucleic acid molecules comprising a polynucleotide encoding a GABRE 
polypeptide having the amino acid sequence shown in SEQ ID NO :42, which was 
determined as described in Example 6. The amino acid sequence of the ET2 
polypeptide of the present invention is identical to approximately the carboxy 
terminal half of GABRE (amino acids from about 247 to about 488 in SEQ ID 
NO:42). 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a GABRE polypeptide having the amino 
acid sequence shown in SEQ ID NO:42. A cDNA clone containing a nucleotide 
sequence corresponding to nucleotides 11-1593 in SEQ IDNO:41 was deposited 
onFebruary25, 1998 at the American Type Culture Collection, 12301 Park Lawn 

Drive, Rockville, Maryland 20852, and given accession number . The 

deposited clone is inserted in the pCDM8 plasmid (Invitrogen, San Diego, CA) 
using the Xhol and Pstl restriction endonuclease cleavage sites. The Xhol site was 
lost during the insertion process. 

The material identified as ATCC accession number was deposited 

as plasmid DNA which must be inserted into a suitable host cell for bacterial 
amplification. One suitable host cell is Escherichia coli, and the deposited vector 
confers ampicillin and tetracycline resistance in transformed E. coli containing the 
P3 episome. 
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Nucleic Acid Molecules 

All amino acid sequences of polypeptides encoded by DNA molecules 
determined herein were predicted by translation of cDNA sequences, which were 
sequenced as described in Examples 1 and 6. As is known in the art, any 
nucleotide sequence determined herein may contain some errors. As is also 
known in the art, a single insertion or deletion in a determined nucleotide sequence 
compared to the actual sequence will cause a frame shift in translation of the 
nucleotide sequence such that the predicted amino acid sequence encoded by a 
determined nucleotide sequence will be completely different from the amino acid 
sequence actually encoded by the sequenced DNA molecule, beginning at the 
point of such an insertion or deletion. 

Unless otherwise indicated, each "nucleotide sequence" set forth herein is 
presented as a sequence of deoxyribonucleotides (abbreviated A, G 5 C and T). 
However, by "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C andU), where each thymidine 
deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced 
by the ribonucleotide uridine (U). For instance, reference to an RNA molecule 
having the sequence of SEQ ID NO:l set forth using deoxyribonucleotide 
abbreviations is intended to indicate an RNA molecule having a sequence in which 
each deoxyribonucleotide A, G or C of SEQ ID NO:l has been replaced by the 
corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been 
replaced by a ribonucleotide U. 

Using the information provided herein, such as the nucleotide sequence in 
SEQ ID NO : 1 or SEQ ID NO :4 1 , a nucleic acid molecule of the present invention 
encoding an ET2 or GABRE polypeptide may be obtained using standard cloning 
and screening procedures, such as those for cloning cDNAs using mRNA as 
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starting material. Illustrative of the invention, the nucleic acid molecule described 
in SEQ ID NO:l was identified in multiple tissues, including heart, placenta, 
pancreas, spleen, thymus, prostate, testis, ovary, small intestine, and colon. The 
determined nucleotide sequence of the ET2 cDNA of SEQ ID NO: 1 contains an 
open reading frame encoding a protein of about 242 amino acid residues. The 
amino acid sequence of the predicted ET2 receptor subunit is shown in SEQ ID 
NO:2 from amino acid residue about 1 to residue about 242. 

Also illustrative of the invention, the nucleic acid molecule described in 
SEQ ID NO:41 was identified in a number of brain tissues, including amygdala 
and thalamus. The determined nucleotide sequence of the GABRE cDNA of SEQ 
ID NO:41 contains an open reading frame encoding a protein of about 506 amino 
acid residues. The amino acid sequence of the predicted GABRE receptor subunit 
is shown in SEQ ID NO:42 from amino acid residue about -18 to residue about 
488. 

The present invention also provides the mature form(s) of the GABRE 
protein of the present invention. According to the signal hypothesis, proteins 
secreted by mammalian cells have a signal or secretory leader sequence which is 
cleaved from the mature protein once export of the growing protein chain across 
the rough endoplasmic reticulum has been initiated. Most mammalian cells and 
even insect cells cleave secreted proteins with the same specificity. However, in 
some cases, cleavage of a secreted protein is not entirely uniform, which results 
in two or more mature species on the protein. Further, it has long been known 
that the cleavage specificity of a secreted protein is ultimately determined by the 
primary structure of the complete protein, that is, it is inherent in the amino acid 
sequence of the polypeptide. Therefore, the present invention provides a 
nucleotide sequence encoding the mature GABRE polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in the plasmid identified as 

ATCC Deposit No. and as shown in SEQ ID NO:42. By the mature 

GABRE protein having the amino acid sequence encoded by the cDNA clone 
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contained in the plasmid identified as ATCC Deposit No. is meant the 

mature form(s) of the GABRE protein produced by expression in a mammalian 
cell (e.g., COS cells, as described below) of the complete open reading frame 
encoded by the human DNA sequence of the clone contained in the vector in the 
deposited host. As indicated below, the mature GABRE protein having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 

may or may not differ from the predicted "mature" GABRE protein 

shown in SEQ ID NO:42 (amino acids from about 1 to about 488) depending on 
the accuracy of the predicted cleavage site based on computer analysis. 

Methods for predicting whether a protein has a secretory leader as well as 
the cleavage point for that leader sequence are available. For instance, the 
methods of McGeoch (Virus Res. 3:271-286 (1985)) and von Heinje (Nucleic 
Acids Res. 74:4683-4690 (1986)) can be used. The accuracy of predicting the 
cleavage points of known mammalian secretory proteins for each of these methods 
is in the range of 75-80%. von Heinje, supra. However, the two methods do not 
always produce the same predicted cleavage point(s) for a given protein. 

The leader sequence for the GABRE protein is predicted to consist of 
amino acid residues from about -18 to about -1 in SEQ ID NO:42, while the 
mature GABRE protein is predicted to consist of residues from about 1 to about 
488. 

The ET2 receptor subunit is believed to be translated from a mRNA splice 
variant of the GABRE genomic coding sequence. Alternative splicing has 
previously been demonstrated for several subunits of the GABA A receptor 
complex (reviewed in Whiting, P.J., et al, Int. Rev. Neurobiol 35:95-138 
(1995)). 

As used herein the phrase "splice variant" refers to RNA molecules initially 
transcribed from the same genomic DNA sequence but have undergone alternative 
splicing. Alternative splicing occurs when the sequence of a primary RNA 
transcript is spliced, wherein generally one or more introns are removed, and more 



-15- 



than one mRNA molecule is produced. Further, each of the mRNA molecules 
may encode different amino acid sequences. The term "splice variant" also refers 
to the proteins encoded by the above RNA molecules produced by alternative 
splicing. 

As indicated, nucleic acid molecules of the present invention may be in the 
form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 
may be the coding strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment For example, 
recombinant DNA molecules contained in a vector are considered isolated for the 
purposes of the present invention. Further examples of isolated DNA molecules 
include recombinant DNA molecules maintained in heterologous host cells or 
purified (partially or substantially) DNA molecules in solution. Isolated RNA 
molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the 
present invention. Isolated nucleic acid molecules according to the present 
invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
positions 3872-3874 of the nucleotide sequence shown in SEQ ID NO:l or at 
positions 41-43 of the nucleotide sequence shown in SEQ ID NO:41 and DNA 
molecules which comprise sequences substantially different from those described 
above but which, due to the degeneracy of the genetic code, still encode an ET2 
or GABRE receptor subunit. Of course, the genetic code is well known in the art. 
Thus, it would be routine for one skilled in the art to generate such degenerate 
variants. 
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In another aspect, the invention provides isolated nucleic acid molecules 
encoding the GABRE polypeptide having an amino acid sequence as encoded by 

the cDNA clone contained in the plasmid deposited as ATCC Deposit No. 

on February 25, 1998. In a further aspect, nucleic acid molecules are provided 
encoding the full-length ET2 or GABRE polypeptide lacking the N-terminal 
methionine. The invention also provides an isolated nucleic acid molecule having 
the nucleotide sequence shown in SEQ ID NO:l or SEQ ID NO:41, or a nucleic 
acid molecule having a sequence complementary to that of SEQ ID NO: 1 or SEQ 
ID NO:41 . Such isolated molecules, particularly DNA molecules, are useful as 
probes for gene mapping, by in situ hybridization with chromosomes, and for 
detecting expression of ET2 or GABRE nucleotide sequences in human tissue, for 
instance, by Northern blot analysis. 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic acid 
molecule having the nucleotide sequence shown in SEQ ID NO:l or SEQ ID 
NO:41 is intended fragments at least about 15 nt, and more preferably at least 
about 20 nt, still more preferably at least about 30 nt, and even more preferably, 
at least about 40 nt in length which are useful as diagnostic probes and primers as 
discussed herein. Of course larger DNA fragments 50, 100, 150, 200, 250, 300, 
350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 
1100, 1150, 1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 
1750, 1800, 1850, 1900, 1950,2000,2050,2100,2150,2200,2250, 2300,2350, 
2400, 2450, 2500, 2550, 2600, 2650, 2700, 2750, 2800, 2850, 2900, 2950, 3000, 
3050, 3100, 3150, 3200, 3250, 3300, 3350, 3400, 3450, 3500, 3550, 3600, 3650, 
3700, 3750, 3800, 3850, 3900, 3950, 4000, 4050, 4100, 4150, 4200, 4250, 4300, 
4350, 4400, 4450, 4500, 4550, 4600, 4650, 4700, 4750, 4800, 4850, 4900, 4950, 
5000, 5050, 5100, 5150, 5200, 5250, 5300, 5350, 5400, 5450, 5500, 5550, 5600, 
5650, 5700, 5750, 5800, 5850, 5900, 5950, 6000, 6050, 6100, or 6146 nt in 
length of the sequence shown in SEQ ID NO:l are also useful according to the 
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present invention. Similarly, larger DNA fragments 50, 100, 1 50, 200, 250, 300, 
350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 
1100, 1150, 1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 
1750, 1800, 1850, 1900, 1950,2000,2050,2100,2150,2200, 2250,2300, 2350, 
2400, 2450, 2500, 2550, 2600, 2650, 2700, 2750, 2800, 2850, 2900, 2950, 3000, 
3050, or 3100 nt in length of the sequence shown in SEQ ID NO:41 are also 
useful according to the present invention as are fragments corresponding to most, 
if not all, of the nucleotide sequence of the cDNA clone contained in the plasmid 

deposited as ATCC Deposit No. or as shown in SEQ ID NO:41. 

By a fragment at least 20 nt in length, for example, is intended fragments 
which include 20 or more contiguous bases from the nucleotide sequence as 
shown in SEQ ID NO:l or SEQ ID NO:41. Since the nucleotide sequences 
shown in SEQ ID NO:l and SEQ ID NO:41 are provided, generating such DNA 
fragments would be routine to the skilled artisan. For example, restriction 
endonuclease cleavage or shearing by sonication could easily be used to generate 
fragments of various sizes. Alternatively, such fragments could be generated 
synthetically. 

Preferred nucleic acid fragments of the present invention include nucleic 
acid molecules encoding polypeptides comprising the ET2 receptor subunit 
transmembrane domains (predicted to constitute amino acid residues from about 
15 to about 36, from about 42 to about 62, from about 75 to about 95, and from 
about 219 to about 240 in SEQ ID NO:2) and the ET2 receptor subunit 
intracellular domain (predicted to constitute amino acid residues from about 96 to 
about 218 in SEQ ID NO:2). Preferred nucleic acid fragments of the present 
invention further include nucleic acid molecules encoding polypeptides comprising 
the GABRE receptor subunit transmembrane domains (predicted to constitute 
amino acid residues from about 261 to about 282, from about 288 to about 308, 
from about 321 to about 341, and from about 465 to about 486 in SEQ ID 
NO: 42), the GABRE receptor subunit extracellular domain (predicted to 
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constitute amino acid residues from about 1 to about 260 in SEQ ID NO:42), and 
the GABRE receptor subunit intracellular domain (predicted to constitute amino 
acid residues from about 342 to about 464 in SEQ ID NO:42). The amino acid 
residues constituting the ET2 and GABRE receptor subunit domains have been 
predicted by computer analysis. Thus, as one of ordinary skill would appreciate, 
the amino acid residues constituting these domains may vary slightly (e.g., by 
about 1 to about 3 amino acid residues) depending on the criteria used to define 
each domain. 

Preferred nucleic acid fragments of the present invention also include 
nucleic acid molecules encoding epitope-bearing portions of the ET2 and GABRE 
receptor subunit proteins. Methods for determining such epitope-bearing portions 
of the ET2 and GABRE proteins are described in detail below. 

In addition, the invention present inventors have identified the following 
cDNA clones related to extensive portions of SEQ ID NO: 1 : HUKAU66R (SEQ 
ID NO:43) and HPLBB96F (SEQ ID NO:44). 

In another aspect, the invention provides isolated nucleic acid molecules 
comprising polynucleotides which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the cDNA sequences shown in SEQ ID 
NO:l and SEQ ID NO:41, or the cDNA clone contained in ATCC Deposit No. 

. By "stringent hybridization conditions" is intended overnight 

incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150 mM 
NaCl, 1 5mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon sperm 
DNA, followed by washing the filters in O.lx SSC at about 65 °C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide 
is intended a polynucleotide (either DNA or RNA) hybridizing to at least about 
1 5 nucleotides (nt), and more preferably at least about 20 nt, still more preferably 
at least about 30 nt, and even more preferably about 30-70 nt of the reference 
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polynucleotide. These are useful as diagnostic probes and primers as discussed 
above and in more detail below. 

Of course, polynucleotides hybridizing to a larger portion of the reference 
polynucleotide (e.g., the cDNA sequence shown in SEQ ID NO:l or SEQ ID 
NO:41), for instance, a portion 50-750 nt in length, or even to the entire length 
of the reference polynucleotide, are also useful as probes according to the present 
invention, as are polynucleotides corresponding to most, if not all, of the 
nucleotide sequence shown in SEQ ID NO:l or SEQ ID NO:41 . By a portion of 
a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more 
contiguous nucleotides from the nucleotide sequence of the reference 
polynucleotide (e.g., the nucleotide sequence as shown in SEQ ID NO:l or SEQ 
ID NO:41). As indicated, such portions are useful diagnostically either as aprobe 
according to conventional DNA hybridization techniques or as primers for 
amplification of a target sequence by the polymerase chain reaction (PCR), as 
described, for instance, in Molecular Cloning, A Laboratory Manual, 2nd. edition, 
Sambrook, J., Fritsch, E. F. and Maniatis, T., eds., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1 989), the entire disclosure of which 
is hereby incorporated herein by reference. 

Since ET2 and GABRE receptor subunit cDNA sequences are provided 
in SEQ ID NO: 1 and SEQ ID NO:41 , generating polynucleotides which hybridize 
to a portion of the ET2 or GABRE receptor subunit cDNA molecule would be 
routine to the skilled artisan. For example, restriction endonuclease cleavage or 
shearing by sonication of an ET2 or GABRE receptor subunit cDNA clone could 
easily be used to generate DNA portions of various sizes which are 
polynucleotides that hybridize to a portion of the ET2 or GABRE receptor subunit 
cDNA molecule. Alternatively, the hybridizing polynucleotides of the present 
invention could be generated synthetically according to known techniques. Of 
course, a polynucleotide which hybridizes only to a poly A sequence (such as the 
3 f terminal poly (A) tract of the ET2 or GABRE receptor subunit cDNA shown in 
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SEQ ID NO:l and SEQ ID NO:41, respectively), or to a complementary stretch 
of T (or U) resides, would not be included in a polynucleotide of the invention 
used to hybridize to a portion of a nucleic acid of the invention, since such a 
polynucleotide would hybridize to any nucleic acid molecule containing apoly (A) 
5 stretch or the complement thereof (e.g., practically any double-stranded cDNA 

clone). 

As indicated, nucleic acid molecules of the present invention which encode 
an ET2 or GABRE polypeptide may include, but are not limited to the coding 
sequence for the polypeptide and additional sequences, such as those encoding a 
10 leader or secretory sequence, such as a pre-, or pro- or prepro- protein sequence; 

the coding sequence of the polypeptide, with or without the aforementioned 
additional coding sequences, together with additional, non-coding sequences, 
including for example, but not limited to introns and non-coding 5' and 3 ! 
fil sequences, such as the transcribed, non-translated sequences that play a role in 

Q 1 5 transcription, mRNA processing, including splicing and poly adenylation signals, 

for example - ribosome binding and stability of mRNA; an additional coding 
sequence which codes for additional amino acids, such as those which provide 
additional functionalities. Thus, the sequence encoding the polypeptide may be 
fused to a marker sequence, such as a sequence encoding a peptide which 
20 facilitates purification of the fused polypeptide. In certain preferred embodiments 

of this aspect of the invention, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, 
many of which are commercially available. As described in Gentz et at, Proa 
Natl Acad Set USA 55:821-824 (1989), for instance, hexa-histidine provides for 
25 convenient purification of the fusion protein. The "HA" tag is another peptide 

useful for purification which corresponds to an epitope derived from the influenza 
hemagglutinin protein, which has been described by Wilson et al., Cell 57:767-778 
(1 984). As discussed below, other such fusion proteins include the ET2 receptor 
subunit fused to Fc at the C- or N-terminus. 



n 
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The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or derivatives 
of an ET2 or GABRE receptor subunit Variants may occur naturally, such as a 
natural allelic variant. By an "allelic variant" is intended one of several alternate 
forms of a gene occupying a given locus on a chromosome of an organism. Genes 
II, Lewin, B., ed., John Wiley & Sons, New York (1985). Non-naturally 
occurring variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions 
or additions. The substitutions, deletions or additions may involve one or more 
nucleotides. The variants may be altered in coding regions, non-coding regions, 
or both. Alterations in the coding regions may produce conservative or 
non-conservative amino acid substitutions, deletions or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do 
not alter the properties and activities of an ET2 or GABRE receptor subunit or 
portions thereof. Also especially preferred in this regard are conservative 
substitutions. Most highly preferred are nucleic acid molecules encoding the 
protein having the amino acid sequence shown in SEQ ID NO:2 or SEQ ID 
NO:42. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 95% 
identical, and more preferably at least 96%, 97%, 98% or 99% identical to (a) a 
nucleotide sequence encoding the polypeptide having the amino acid sequence in 
SEQ ID NO:2 or SEQ ID NO:42; (b) a nucleotide sequence encoding the 
polypeptide having the amino acid sequence in SEQ ID NO:2 or SEQ ID NO:42, 
but lacking the N-terminal methionine; (c) a nucleotide sequence encoding amino 
acids from about 1 to about 488 sequence in SEQ ID NO:42; (d) a nucleotide 
sequence encoding the polypeptide having the amino acid sequence encoded by 

the cDNA clone contained in ATCC Deposit No. ; (e) a nucleotide 

sequence encoding the mature GABRE polypeptide having the amino acid 
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sequence encoded by the cDNA clone contained in ATCC Deposit No. ; 

(f) a nucleotide sequence encoding one or more ET2 or GABRE receptor subunit 
transmembrane domains; (g) a nucleotide sequence encoding the ET2 or GABRE 
intracellular domain; (h) a nucleotide sequence encoding the ET2 or GABRE 
protein with all or part of one or more of the transmembrane domains deleted; (i) 
a nucleotide sequence encoding the GABRE receptor subunit extracellular 
domain; and (j) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a), (b), (c), (d), (e), (f), (g), (h), or (i). 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding an ET2 or GABRE 
polypeptide is intended that the nucleotide sequence of the polynucleotide is 
identical to the reference sequence except that the polynucleotide sequence may 
include up to five point mutations per each 100 nucleotides of the reference 
nucleotide sequence encoding the ET2 or GABRE receptor subunit. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% 
identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence 
may be inserted into the reference sequence. These mutations of the reference 
sequence may occur at the 5 f or 3' terminal positions of the reference nucleotide 
sequence or anywhere between those terminal positions, interspersed either 
individually among nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 95%, 96%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in SEQ ID NO:l or SEQ ID NO:41 can be determined 
conventionally using known computer programs such as the Bestfit program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer 
Group, University Research Park, 575 Science Drive, Madison, WI 53711). 
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Bestfit uses the local homology algorithm of Smith and Waterman, Advances in 
Applied Mathematics 2:482-489 (1981), to find the best segment of homology 
between two sequences. When using Bestfit or any other sequence alignment 
program to determine whether a particular sequence is, for instance, 95% identical 
to a reference sequence according to the present invention, the parameters are set, 
of course, such that the percentage of identity is calculated over the full length of 
the reference nucleotide sequence and that gaps in homology of up to 5% of the 
total number of nucleotides in the reference sequence are allowed. 

The present application is directed to nucleic acid molecules at least 95%, 
96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in SEQ ID 
NO:l or SEQ ID NO:41, irrespective of whether they encode a polypeptide 
having ET2 or GABRE receptor subunit activity. This is because even where a 
particular nucleic acid molecule does not encode a polypeptide having ET2 or 
GABRE receptor subunit activity, one of skill in the art would still know how to 
use the nucleic acid molecule, for instance, as a hybridization probe or a 
polymerase chain reaction (PCR) primer. Uses of the nucleic acid molecules of 
the present invention that do not encode a polypeptide having ET2 or GABRE 
receptor subunit activity include, inter alia, (1) isolating the ET2 or GABRE 
receptor subunit nucleotide sequence, or allelic variants thereof, in a cDNA 
library; (2) in situ hybridization (e.g. , "FISH") to metaphase chromosomal spreads 
to provide precise chromosomal location of the gene encoding the ET2 and 
GABRE receptor subunits, as described in Verma et al, Human Chromosomes: 
A Manual of Basic Techniques, Pergamon Press, New York (1988); and (3) 
Northern blot analysis for detecting ET2 or GABRE receptor subunit mRNA 
expression in specific tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 
95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in 
SEQ ID NO:l or SEQ ID NO:41 which do, in fact, encode a polypeptide having 
ET2 or GABRE receptor subunit activity. By "a polypeptide having ET2 or 
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GABRE receptor subunit activity" is intended polypeptides exhibiting activity 
similar, but not necessarily identical, to an activity of the ET2 or GABRE receptor 
subunit of the invention (e.g., the full-length protein or a functional domain 
thereof), as measured in a particular biological assay. For example, ET2 or 
GABRE receptor subunit activity may be measured by co-transfection of cells with 
GABA A receptor a and P subunits and a polypeptide suspected of having ET2 or 
GABRE receptor activity. After transfection, the effect that the candidate 
polypeptide has on the activation of the GABA A receptor complex can be 
determined by measurement of concentration-dependent activation of chloride 
currents as described in Davies, P. et aL, Nature 555:820-823 (1997), and 
Example 6, in the presence and absence of the candidate polypeptide. Additional 
assays for ET2 and GABRE activities are described in Davies, P. et al, Br. J. 
Pharmacol. 720:899-909 (1997), Davies, P. etal, Nature 355:820-823 (1997), 
and Example 6. 

Further, since the ET2 and GABRE polypeptide are each believed to 
assemble into GABA A receptor complexes, ET2 or GABRE receptor subunit 
activity can be measured by determining the binding affinity of a candidate 
polypeptide towards other members of this complex. In vivo and in vitro 
techniques for measuring such binding activity are described in Ausbel, F. et al , 
eds., Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1997), 
Chapter 20. These techniques include the use of interaction trap/two hybrid 
assays and binding assays involving GST fusion proteins. 

Thus, "a polypeptide having ET2 or GABRE receptor subunit activity" 
includes polypeptides that exhibit ET2 or GABRE receptor subunit activity, in the 
above-described assays. Although the degree of activity need not be identical to 
that of the ET2 or GABRE receptor subunit, preferably, "a polypeptide having 
ET2 or GABRE receptor subunit activity" will exhibit substantially similar activity 
as compared to the ET2 or GABRE protein {i.e., the candidate polypeptide will 
exhibit greater activity or not more than about twenty-fold less and, preferably, 
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not more than about ten-fold less activity relative to the reference ET2 or GABRE 
protein). 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
in the art will immediately recognize that a large number of the nucleic acid 
molecules having a sequence at least 95%, 96%, 97%, 98%, or 99% identical to 
the nucleic acid sequence shown in SEQ ID NO: 1 or SEQ ID NO:41 will encode 
"a polypeptide having ET2 or GABRE receptor subunit activity." In fact, since 
degenerate variants of these nucleotide sequences all encode the same polypeptide, 
this will be clear to the skilled artisan even without performing the above 
described comparison assay. It will be further recognized in the art that, for such 
nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having ET2 or GABRE protein activity. This is so 
because the skilled artisan is fully aware of amino acid substitutions that are either 
less likely or not likely to significantly effect protein function (e.g. , replacing one 
aliphatic amino acid with a second aliphatic amino acid). 

For example, guidance concerning how to make phenotypically silent 
amino acid substitutions is provided in Bowie, J. U. et al, "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 
247:1306-13 10 (1990), wherein the authors indicate that there are two main 
approaches for studying the tolerance of an amino acid sequence to change. The 
first method relies on the process of evolution, in which mutations are either 
accepted or rejected by natural selection. The second approach uses genetic 
engineering to introduce amino acid changes at specific positions of a cloned gene 
and selections or screens to identify sequences that maintain functionality. As the 
authors state, these studies have revealed that proteins are surprisingly tolerant of 
amino acid substitutions. The authors further indicate which amino acid changes 
are likely to be permissive at a certain position of the protein. For example, most 
buried amino acid residues require nonpolar side chains, whereas few features of 
surface side chains are generally conserved. Other such phenotypically silent 
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substitutions are described in Bowie, J. U. et at, supra, and the references cited 
therein. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of ET2 or GAB RE 
polypeptides or fragments thereof by recombinant techniques. 

Recombinant constructs may be introduced into host cells using well 
known techniques such as infection, transduction, transfection, transvection, 
electroporation and transformation. The vector may be, for example, a phage, 
plasmid, viral or retroviral vector. Retroviral vectors may be replication 
competent or replication defective. In the latter case, viral propagation generally 
will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising c/s-acting control regions to the 
polynucleotide of interest. Appropriate trans-acting factors may be supplied by 
the host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for 
specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 
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Expression vectors useful in the present invention include chromosomal-, 
episomal- and virus-derived vectors, e.g. , vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations 
thereof, such as cosmids and phagemids. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
S V40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
constructs will further contain sites for transcription initiation, termination and, in 
the transcribed region, a ribosome binding site for translation. The coding portion 
of the transcripts expressed by the constructs will preferably include a translation 
initiating at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes 
for culturing in E. coli and other bacteria. Representative examples of appropriate 
hosts include, but are not limited to, bacterial cells, such as E. coli^ Streptomyces 
and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells 
such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 
vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. 
Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
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and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL 
available from Pharmacia. Other suitable vectors will be readily apparent to the 
skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the E. coli lacl and lacZ promoters, the T3 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable 
eukaryotic promoters include the CMV immediate early promoter, the HSV 
thymidine kinase promoter, the early and late S V40 promoters, the promoters of 
retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and 
metallothionein promoters, such as the mouse metallothionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, such 
as Davis et ah, Basic Methods In Molecular Biology (1986). 

Transcription of the DNA encoding the polypeptides of the present 
invention by higher eukaryotes may be increased by inserting an enhancer 
sequence into the vector. Enhancers are czs-acting elements of DNA, usually 
about from 10 to 300 bp that act to increase transcriptional activity of a promoter 
in a given host cell-type. Examples of enhancers include the SV40 enhancer, 
which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side 
of the replication origin, and adenovirus enhancers. 

For secretion of the translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 
appropriate secretion signals may be incorporated into the expressed polypeptide. 
The signals may be endogenous to the polypeptide or they may be heterologous 
signals. 
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The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques in 
the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EPA 0 464 5 3 3 
(Canadian counterpart 2045869) discloses fusion proteins comprising various 
portions of constant region of immunoglobin molecules together with another 
human protein or part thereof. In many cases, the Fc part in a fusion protein is 
thoroughly advantageous for use in therapy and diagnosis and thus results, for 
example, in improved pharmacokinetic properties (EPA 0 232 262). On the other 
hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and purified in the advantageous 
manner described. This is the case when Fc portion proves to be a hindrance to 
use in therapy and diagnosis, for example when the fusion protein is to be used as 
antigen for immunizations. In drug discovery, for example, human proteins, such 
as, hIL5-receptor has been fused with Fc portions for the purpose of 
high-throughput screening assays to identify antagonists of hIL-5. See, D. 
Bennett et al.,J. Molec. Recognition 5:52-58 (1995) and K. Johanson e^ a/., J. 
Biol Chem. 270:9459-9471 (1995). 

In addition to the use of expression vectors in the practice of the present 
invention, the present invention further includes novel expression vectors 
comprising operator and promoter elements operatively linked to nucleotide 
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sequences encoding a protein of interest. One example of such a vector is pHE4a 
which is described in detail below. 

As summarized in FIG. 7 and FIG. 8, components of the pHE4a vector 
(SEQ ID NO: 45) include: 1) a neomycinphosphotransferase gene as a selection 
marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 
4) two lac operator sequences, 5) a Shine-Delgarno sequence, 6) the lactose 
operon repressor gene (laclq) and 7) a multiple cloning site linker region. The 
origin of replication (oriC) is derived from pUC 1 9 (LTI, Gaithersburg, MD). The 
promoter sequence and operator sequences were made synthetically. Synthetic 
production of nucleic acid sequences is well known in the art. Clontech 95/96 
Catalog, pages 215-216, Clontech, 1020 East Meadow Circle, Palo Alto, CA 

94303. The pHE4a vector was deposited with the ATCC on , 1998, and 

given accession number . 

A nucleotide sequence encoding either ET2 (SEQ ID NO: 1) or GABRE 
(SEQ ID NO:4 1 ), is operatively linked to the promoter and operator of pHE4a by 
restricting the vector with Ndel and either Xbal, BamYLI, Xhol, or Aspl\ 8, and 
isolating the larger fragment (the multiple cloning site region is about 310 
nucleotides) on a gel. The nucleotide sequence encoding ET2 (SEQ ID NO:l) or 
GABRE (SEQ ID NO:4 1 ) having the appropriate restriction sites is generated, for 
example, according to the PCR protocol described in Example 1, using PCR 
primers having restriction sites for Ndel (as the 5' primer) and either Xba\ BamHl, 
Xho\ oxAspl\% (as the 3' primer). The PCR insert is gel purified and restricted 
with compatible enzymes. The insert and vector are ligated according to standard 
protocols. 

As noted above, the pHE4a vector contains a /aclq gene. Zaclq is an 
allele of the lacl gene which confers tight regulation of the lac operator. Amann, 
E. et al, Gene £9:301-315 (1988); Stark, M., Gene 57:255-267 (1987). The 
laclq gene encodes a repressor protein which binds to lac operator sequences and 
blocks transcription of down-stream (i.e., 3 r ) sequences. However, the laclq gene 
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product dissociates from the lac operator in the presence of either lactose or 
certain lactose analogs, e.g., isopropyl B-D-thiogalactopyranoside (IPTG). The 
ET2 or GABRE protein thus is not produced in appreciable quantities in 
uninduced host cells containing the pHE4a vector. Induction of these host cells 
by the addition of an agent such as IPTG, however, results in the expression of the 
ET2 or GABRE coding sequence. 

The promoter/operator sequences of the pHE4a vector (SEQ ID NO:46) 
comprise a T5 phage promoter and two lac operator sequences. One operator is 
located 5 1 to the transcriptional start site and the other is located 3' to the same 
site. These operators, when present in combination with the laclq gene product, 
confer tight repression of down-stream sequences in the absence of a lac operon 
inducer, e.g., IPTG. Expression of operatively linked sequences located 
down-stream from the lac operators may be induced by the addition of a lac 
operon inducer, such as IPTG. Binding of a lac inducer to the laclq proteins 
results in their release from the lac operator sequences and the initiation of 
transcription of operatively linked sequences. Lac operon regulation of gene 
expression is reviewed in Devlin, T., Textbook of Biochemistry with Clinical 
Correlations, 4th Edition (1997), pages 802-807. 

The pHE4 series of vectors contain all of the components of the pHE4a 
vector except for the ET2 and GABRE coding sequence. Features of the pHE4a 
vectors include optimized synthetic T5 phage promoter, lac operator, and Shine- 
Delgarno sequences. Further, these sequences are also optimally spaced so that 
expression of an inserted gene may be tightly regulated and high level of 
expression occurs upon induction. 

Among known bacterial promoters suitable for use in the production of 
proteins of the present invention include the E. coli lacl and lacL promoters, the 
T3 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the 
trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 
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promoters, the promoters of retroviral LTRs, such as those of the Rous Sarcoma 
Virus (RS V), and metallothionein promoters, such as the mouse metallothionein-I 
promoter. 

The pHE4a vector also contains a Shine-Delgarno sequence 5 ' to the AUG 
5 initiation codon. Shine-Delgarno sequences are short sequences generally located 

about 10 nucleotides up-stream (i.e., 5') from the AUG initiation codon. These 
sequences essentially direct prokaryotic ribosomes to the AUG initiation codon. 

Thus, the present invention is also directed to expression vector useful for 
the production of the proteins of the present invention. This aspect of the 
^10 invention is exemplified by the pHE4a vector (SEQ ID NO:45). 

0 ET2 and GABRE receptor subunits can be recovered and purified from 

yj recombinant cell cultures by well-known methods including ammonium sulfate or 

ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
W phosphocellulose chromatography, hydrophobic interaction chromatography, 

5 & 

s 15 affinity chromatography, hydroxy lapatite chromatography and lectin 

Sf| chromatography. Most preferably, high performance liquid chromatography 

iV ? ("HPLC") is employed for purification. Polypeptides of the present invention 

%fj include naturally purified products, products of chemical synthetic procedures, and 

^ products produced by recombinant techniques from a prokaryotic or eukaryotic 

20 host, including, for example, bacterial, yeast, higher plant, insect and mammalian 

cells. Depending upon the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the invention may also include an 
initial modified methionine residue, in some cases as a result of host-mediated 
25 processes. 
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ET2 and GABRE Polypeptides and Fragments 

The invention further provides isolated ET2 polypeptides having the amino 
acid sequence in SEQ ID NO:2 and SEQ ID NO:42, as well as peptides or 
polypeptides comprising portions of the sequences shown in SEQ ID NO:2 and 
SEQ ID NO:42. The terms "peptide" and "oligopeptide" are considered 
synonymous (as is commonly recognized) and each term can be used 
interchangeably as the context requires to indicate a chain of at least two amino 
acids coupled by peptidyl linkages. The word "polypeptide" is used herein for 
chains containing more than ten amino acid residues. All oligopeptide and 
polypeptide formulas or sequences herein are written from left to right and in the 
direction from amino terminus to carboxy terminus. 

It will be recognized in the art that some amino acid sequences of the ET2 
and GABRE receptor subunits can be varied without significant effect on the 
structure or function of the protein. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the 
protein which determine activity. In general, it is possible to replace residues 
which form the tertiary structure, provided that residues performing a similar 
function are used. In other instances, the type of residue may be completely 
unimportant if the alteration occurs at a non-critical region of the protein. 

Thus, the invention further includes variations of the ET2 and GABRE 
receptor subunits which show substantial ET2 or GABRE receptor subunit 
activity or which include regions of the ET2 or GABRE protein such as the 
protein portions discussed below. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions (for example, substituting one 
hydrophilic residue for another, but not strongly hydrophilic for strongly 
hydrophobic as a rule). Small changes or such "neutral" amino acid substitutions 
will generally have little effect on activity. 
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Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu and lie; interchange of the 
hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, 
substitution between the amide residues Asn and Gin, exchange of the basic 
5 residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. 

As indicated in detail above, further guidance concerning which amino acid 
changes are likely to be phenotypically silent {i.e., are not likely to have a 
significant deleterious effect on a function) can be found in Bowie, J.U., et al, 
"Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 

10 Substitutions," Science 2^7.1306-1310 (1990). 

I 

i Thus, the fragment, derivative or analog of the polypeptide of SEQ ID 

I NO:2 or SEQ ID NO:42 may be (i) one in which one or more of the amino acid 

: residues are substituted with a conserved or non-conserved amino acid residue 

I 

j (preferably a conserved amino acid residue) and such substituted amino acid 



B 15 residue may or may not be one encoded by the genetic code, or (ii) one in which 

Jr! one or more of the amino acid residues includes a substituent group, or (iii) one 

Hi in which the polypeptide is fused with another compound, such as a compound to 

if increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) 

w one in which the additional amino acids are fused to the polypeptide, such as an 

20 IgG Fc fusion region peptide or leader or secretory sequence or a sequence which 



is employed for purification of the polypeptide or a proprotein sequence. Such 
fragments, derivatives and analogs are deemed to be within the scope of those 
skilled in the art from the teachings herein. 

Of particular interest are substitutions of charged amino acids with another 
25 charged amino acid and with neutral or negatively charged amino acids. The latter 

results in proteins with reduced positive charge to improve the characteristics of 
the ET2 or GABRE protein. The prevention of aggregation is highly desirable. 
Aggregation of proteins not only results in a loss of activity but can also be 
problematic when preparing pharmaceutical formulations, because they can be 
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immunogenic. (Pinckard et al, Clin Exp. Immunol. 2:331-340 (1967); Robbins 
et al, Diabetes 36:838-845 (1987); Cleland et al Crit Rev. Therapeutic Drug 
Carrier Systems 70:307-377 (1993)). 

The replacement of amino acids can also change the selectivity of binding 
to cell surface receptors. Ostade et al, Nature 361:266-268 (1993) describes 
certain mutations resulting in selective binding of TNF-a to only one of the two 
known types of TNF receptors. Thus, the ET2 and GABRE receptor subunits of 
the present invention may include one or more amino acid substitutions, deletions 
or additions, either from natural mutations or human manipulation. 

As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding 
or activity of the protein (see Table 1). 



TABLE 1 . Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 
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Of course, the number of amino acid substitutions a skilled artisan would 
make depends on many factors, including those described above and below. 
Generally speaking, the number of substitutions for any given ET2 or GABRE 
polypeptide, or mutant thereof, will not be more than 50, 40, 30, 20, 10, 5, or 3, 
depending on the objective. 

Amino acids in the ET2 or GABRE protein of the present invention that 
are essential for function can be identified by methods known in the art, such as 
site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and 
Wells, Science 244: 1081-1085 (1989)). The latter procedure introduces single 
alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for biological activity such as binding activity towards 
other members of the GABA A receptor complex. Sites that are critical for such 
binding can also be determined by structural analysis such as crystallization, 
nuclear magnetic resonance or photoaffinity labeling (Smith et al, J. Mol Biol 
224:899-904 (1992) and de Vos et al Science 255:306-312 (1992)). 

The polypeptides of the present invention are preferably provided in an 
isolated form. By "isolated polypeptide" is intended a polypeptide removed from 
its native environment. Thus, a polypeptide produced and/or contained within a 
recombinant host cell is considered isolated for purposes of the present invention. 
Also intended as an "isolated polypeptide" are polypeptides that have been 
purified, partially or substantially, from a recombinant host cell. For example, a 
recombinantly produced version of the ET2 or GABRE polypeptide can be 
substantially purified by the one-step method described in Smith and Johnson, 
Gene 67:31-40 (1988). 

The polypeptides of the present invention include a polypeptide comprising 
amino acids about 1 to about 242 in SEQ ID NO:2; a polypeptide comprising 
amino acids about 2 to about 242 in SEQ ID NO:2; a polypeptide comprising one 
or more of the ET2 receptor subunit transmembrane domains (amino acids from 
about 1 5 to about 36 in SEQ ID NO:2, amino acids from about 42 to about 62 in 
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SEQ ID NO:2, amino acids from about 75 to about 95 in SEQ ID NO:2, amino 
acids from about 219 to about 240 in SEQ ID NO:2); a polypeptide comprising 
the intracellular domain of the ET2 receptor subunit (amino acids from about 96 
to about 218 in SEQ ID NO:2); a polypeptide comprising the ET2 receptor 
subunit with all or part of one or more of the transmembrane domains deleted; as 
well as polypeptides at least 95% identical, more preferably at least 96%, 97%, 
98% or 99% identical to the polypeptide of SEQ ID NO:2, and also include 
portions of such polypeptides with at least 30 amino acids and more preferably at 
least 50 amino acids. 

The polypeptides of the present invention include a polypeptide comprising 
amino acids about -1 8 to about 488 in SEQ ID NO:42; a polypeptide comprising 
amino acids about - 1 7 to about 488 in SEQ ID NO:42; a polypeptide comprising 
amino acids about 1 to about 488 in SEQ ID NO:42; a polypeptide comprising 
one or more of the GABRE receptor subunit transmembrane domains (amino 
acids from about 26 1 to about 282 in SEQ ID NO:42, amino acids from about 288 
to about 308 in SEQ ID NO:42, amino acids from about 321 to about 341 in SEQ 
ID NO:42, amino acids from about 465 to about 486 in SEQ ID NO:42); a 
polypeptide comprising the extracellular domain of the GABRE receptor subunit 
(amino acids from about 1 to about 260 in SEQ ID NO:42); a polypeptide 
comprising the intracellular domain of the GABRE receptor subunit (amino acids 
from about 342 to about 464 in SEQ ID NO:42); a polypeptide comprising the 
GABRE receptor subunit with all or part of one or more of the transmembrane 
domains deleted; as well as polypeptides at least 95% identical, more preferably 
at least 96%, 97%, 98% or 99% identical to the polypeptide of SEQ ID NO:42, 
and also include portions of such polypeptides with at least 30 amino acids and 
more preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a reference amino acid sequence of an ET2 or GABRE polypeptide 
is intended that the amino acid sequence of the polypeptide is identical to the 
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reference sequence except that the polypeptide sequence may include up to five 
amino acid alterations per each 1 00 amino acids of the reference amino acid of the 
ET2 or GABRE receptor subunit. In other words, to obtain a polypeptide having 
an amino acid sequence at least 95% identical to a reference amino acid sequence, 
up to 5% of the amino acid residues in the reference sequence may be deleted or 
substituted with another amino acid, or a number of amino acids up to 5% of the 
total amino acid residues in the reference sequence may be inserted into the 
reference sequence. These alterations of the reference sequence may occur at the 
amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

As a practical matter, whether any particular polypeptide is at least 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
in SEQ ID NO:2 or SEQ ID NO:42 can be determined conventionally using 
known computer programs such the Bestfit program (Wisconsin Sequence 
Analysis Package, Version 8 for Unix, Genetics Computer Group, University 
Research Park, 575 Science Drive, Madison, WI 5371 1). When using Bestfit or 
any other sequence alignment program to determine whether a particular sequence 
is, for instance, 95% identical to a reference sequence according to the present 
invention, the parameters are set, of course, such that the percentage of identity 
is calculated over the full length of the reference amino acid sequence and that 
gaps in homology of up to 5% of the total number of amino acid residues in the 
reference sequence are allowed. 

The polypeptide of the present invention could be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods well known to those of skill in the art. 

As described in detail below, the polypeptides of the present invention can 
also be used to raise polyclonal and monoclonal antibodies, which are useful in 
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assays for detecting ET2 or GABRE protein expression as described below or as 
agonists and antagonists capable of enhancing or inhibiting ET2 or GABRE 
receptor subunit function. Further, such polypeptides can be used in the yeast 
two-hybrid system to "capture" ET2 or GABRE receptor subunit binding proteins 
which are also candidate agonist and antagonist according to the present 
invention. The yeast two hybrid system is described in Fields and Song, Nature 
340:245-246 (1989). 

In another aspect, the invention provides a peptide or polypeptide 
comprising an epitope-bearing portion of a polypeptide of the invention. The 
epitope of this polypeptide portion is an immunogenic or antigenic epitope of a 
polypeptide of the invention. An "immunogenic epitope" is defined as a part of 
a protein that elicits an antibody response when the whole protein is the 
immunogen. These immunogenic epitopes are believed to be confined to a few 
loci on the molecule. On the other hand, a region of a protein molecule to which 
an antibody can bind is defined as an "antigenic epitope." The number of 
immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, for instance, Geysen et al, Proc. Natl Acad. Set USA 
57:3998-4002(1983). 

As to the selection of peptides or polypeptides bearing an antigenic epitope 
(i e. , that contain a region of a protein molecule to which an antibody can bind), 
it is well known in that art that relatively short synthetic peptides that mimic part 
of a protein sequence are routinely capable of eliciting an antiserum that reacts 
with the partially mimicked protein. See, for instance, Sutcliffe, J. G. et al, 
Antibodies that React with Predetermined Sites on Proteins, Science 219:660-666 
(1983). Peptides capable of eliciting protein-reactive sera are frequently 
represented in the primary sequence of a protein, can be characterized by a set of 
simple chemical rules, and are confined neither to immunodominant regions of 
intact proteins (z.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. Peptides that are extremely hydrophobic and those of six or fewer 
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residues generally are ineffective at inducing antibodies that bind to the mimicked 
protein; longer, peptides, especially those containing proline residues, usually are 
effective. Sutcliffe et al , supra, at 66 1 . For instance, 1 8 of 20 peptides designed 
according to these guidelines, containing 8-39 residues covering 75% of the 
sequence of the influenza virus hemagglutinin HA1 polypeptide chain, induced 
antibodies that reacted with the HA1 protein or intact virus; and 12/12 peptides 
from the MuLV polymerase and 18/18 from the rabies glycoprotein induced 
antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. Thus, a high proportion of 
hybridomas obtained by fusion of spleen cells from donors immunized with an 
antigen epitope-bearing peptide generally secrete antibody reactive with the native 
protein. Sutcliffe et al. 9 supra, at 663. The antibodies raised by antigenic 
epitope-bearing peptides or polypeptides are useful to detect the mimicked 
protein, and antibodies to different peptides may be used for tracking the fate of 
various regions of a protein precursor which undergoes post-translational 
processing. The peptides and anti-peptide antibodies may be used in a variety of 
qualitative or quantitative assays for the mimicked protein, for instance in 
competition assays since it has been shown that even short peptides (e.g., about 
9 amino acids) can bind and displace the larger peptides in immunoprecipitation 
assays. See, for instance, Wilson etf al, Cell 37:767-778 (1984) at 777. The anti- 
peptide antibodies of the invention also are useful for purification of the mimicked 
protein, for instance, by adsorption chromatography using methods well known 
in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
designed according to the above guidelines preferably contain a sequence of at 
least seven, more preferably at least nine and most preferably between about 1 5 
to about 30 amino acids contained within the amino acid sequence of a 
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polypeptide of the invention. However, peptides or polypeptides comprising a 
larger portion of an amino acid sequence of a polypeptide of the invention, 
containing about 30 to about 50 amino acids, or any length up to and including the 
entire amino acid sequence of a polypeptide of the invention, also are considered 
epitope-bearing peptides or polypeptides of the invention and also are useful for 
inducing antibodies that react with the mimicked protein. Preferably, the amino 
acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (z.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides 
including recombinant means using nucleic acid molecules of the invention. For 
instance, a short epitope-bearing amino acid sequence may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and 
purification, as well as during immunization to produce anti-peptide antibodies. 
Epitope-bearing peptides also may be synthesized using known methods of 
chemical synthesis. For instance, Houghten has described a simple method for 
synthesis of large numbers of peptides, such as 10-20 mg of 248 different 13 
residue peptides representing single amino acid variants of a segment of the HA1 
polypeptide which were prepared and characterized (by ELISA-type binding 
studies) in less than four weeks. Houghten, R. A., General Method for the Rapid 
Solid-Phase Synthesis of Large Numbers of Peptides: Specificity of 
Antigen- Antibody Interaction at the Level of Individual Amino Acids, Proc. Natl 
Acad. Set USA 52:5131-5135 (1985). This "Simultaneous Multiple Peptide 
Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,21 1 to 
Houghten et ah (1986). In this procedure the individual resins for the solid-phase 
synthesis of various peptides are contained in separate solvent-permeable packets, 
enabling the optimal use of the many identical repetitive steps involved in 
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solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously. Houghten et al 9 supra, at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to 
induce antibodies according to methods well known in the art. See, for instance, 
5 Sutcliffe et aL 9 supra; Wilson et al , supra; Chow, M. et al, Proc. Natl Acad Scl 

USA 52:910-914; and Bittle, F. J. et al, J. Gen. Virol 56:2347-2354 (1985). 
Generally, animals may be immunized with free peptide; however, anti-peptide 
antibody titer may be boosted by coupling of the peptide to a macromolecular 
carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
^10 peptides containing cysteine may be coupled to carrier using a linker such as 

# m-maleimidobenzoyl-N-hydroxysiiccinimide ester (MBS), while other peptides 

C3 

may be coupled to carrier using a more general linking agent such as 

CI 

^ glutaraldehy de. Animals such as rabbits, rats and mice are immunized with either 

W free or carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal 

s 15 injection of emulsions containing about 100 (ig peptide or carrier protein and 

Sfj Freund's adjuvant. Several booster injections may be needed, for instance, at 

*y intervals of about two weeks, to provide a useful titer of anti-peptide antibody 

0 which can be detected, for example, by ELIS A assay using free peptide adsorbed 

ffl 

to a solid surface. The titer of anti-peptide antibodies in serum from an immunized 
20 animal may be increased by selection of anti-peptide antibodies, for instance, by 

adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 

Immunogenic epitope-bearing peptides of the invention, /.e,, those parts 
of a protein that elicit an antibody response when the whole protein is the 
25 immunogen, are identified according to methods known in the art. For instance, 

Gey sen et al , supra, discloses a procedure for rapid concurrent synthesis on solid 
supports of hundreds of peptides of sufficient purity to react in an enzyme-linked 
immunosorbent assay. Interaction of synthesized peptides with antibodies is then 
easily detected without removing them from the support. In this manner a peptide 
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bearing an immunogenic epitope of a desired protein may be identified routinely 
by one of ordinary skill in the art. For instance, the immunologically important 
epitope in the coat protein of foot-and-mouth disease virus was located by Gey sen 
et al. supra with a resolution of seven amino acids by synthesis of an overlapping 
set of all 208 possible hexapeptides covering the entire 213 amino acid sequence 
of the protein. Then, a complete replacement set of peptides in which all 20 amino 
acids were substituted in turn at every position within the epitope were 
synthesized, and the particular amino acids conferring specificity for the reaction 
with antibody were determined. Thus, peptide analogs of the epitope-bearing 
peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,78 1 to Geysen (1987) further describes this method of identifying apeptide 
bearing an immunogenic epitope of a desired protein. 

Further still, U.S. Patent No. 5,194,392 to Geysen (1990) describes a 
general method of detecting or determining the sequence of monomers (amino 
acids or other compounds) which is a topological equivalent of the epitope (z.e., 
a "mimotope") which is complementary to a particular paratope (antigen binding 
site) of an antibody of interest. More generally, U.S. Patent No. 4,433,092 to 
Geysen (1989) describes a method of detecting or determining a sequence of 
monomers which is a topographical equivalent of a ligand which is complementary 
to the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent 
No. 5,480,971 to Houghten, R. A. et al (1996) on Peralkylated Oligopeptide 
Mixtures discloses linear C r C 7 -alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 
libraries for determining the sequence of a peralkylated oligopeptide that 
preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs 
of the epitope-bearing peptides of the invention also can be made routinely by 
these methods. 

The entire disclosure of each document cited in this section on "ET2 and 
GABRE Polypeptides and Fragments" is hereby incorporated herein by reference. 
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As one of skill in the art will appreciate, ET2 or GABRE polypeptides of 
the present invention and the epitope-bearing fragments thereof described above 
can be combined with parts of the constant domain of immunoglobulins (IgG), 
resulting in chimeric polypeptides. These fusion proteins facilitate purification and 
show an increased half-life in vivo. This has been shown, e.g., for chimeric 
proteins consisting of the first two domains of the human CD4-polypeptide and 
various domains of the constant regions of the heavy or light chains of mammalian 
immunoglobulins (EPA 394,827; Traunecker et al, Nature 357:84-86 (1988)). 
Fusion proteins that have a disulfide-linked dimeric structure due to the IgG part 
can also be more efficient in binding and neutralizing other molecules than the 
monomeric ET2 or GABRE protein or protein fragments alone (Fountoulakis et 
aL, J. Biochem. 270:3958-3964 (1995)). 

Disease Diagnosis and Prognosis 

It is believed that tissues in mammals with certain disease states express 
significantly altered levels of the ET2 and/or GABRE receptor subunit and mRN A 
encoding the ET2 and/or GABRE receptor subunit when compared to a 
corresponding "standard" mammal, i.e. , a mammal of the same species not having 
the disease state. Further, it is believed that altered levels of the ET2 and/or 
GABRE receptor subunit can be detected in certain body fluids {e.g. , sera, plasma, 
urine, and spinal fluid) from mammals with these disease states when compared 
to sera from mammals of the same species not having the disease. Thus, the 
invention provides a diagnostic method useful during diagnosis, which involves 
assaying the expression level of the gene encoding the ET2 and GABRE receptor 
subunits in mammalian cells or body fluid and comparing the gene expression level 
with a standard expression level, whereby a significantly altered gene expression 
level in comparison to the standard is indicative of certain diseases. One variation 
of this method involves assaying for the level of mRNA molecules encoding either 



-45- 



ET2 or GABRE polypeptides and comparing these mRNA level with standard 
ET2 or GABRE mRNA levels. Another variation involves assaying for the level 
of mRNA molecules encoding both ET2 and GABRE polypeptides and comparing 
these mRNA level with each other to determine a ratio of ET2 to GABRE 
expression and comparing this ratio to a standard ET2 to GABRE ratio. A further 
variation involves assaying for the level of mRNA molecules encoding both ET2 
and GABRE polypeptides to determine the total amount of ET2 and GABRE 
mRNA molecules and comparing this total amount to a standard total amount. 

Where the diagnosis of a disease state has already been made according to 
conventional methods, the present invention is useful as a prognostic indicator, 
whereby patients exhibiting significantly altered ET2 and/or GABRE receptor 
subunit expression will experience a worse clinical outcome relative to patients 
expressing the gene encoding these proteins at a level which deviates less from 
that of the standard. 

By "assaying the expression level of the gene encoding the ET2 and 
GABRE receptor subunits" is intended qualitatively or quantitatively measuring 
or estimating the level of the ET2 and/or GABRE protein or the level of the 
mRNA encoding the ET2 and/or GABRE receptor subunit in a first biological 
sample either directly (e.g. , by determining or estimating absolute protein level or 
mRNA level) or relatively (e.g. , by comparing to the ET2 and/or GABRE protein 
level or mRNA level in a second biological sample). 

Preferably, the ET2 and/or GABRE protein level or mRNA level in the 
first biological sample is measured or estimated and compared to a standard ET2 
and/or GABRE protein level or mRNA level, the standard being taken from a 
second biological sample obtained from an individual not having the disease state. 
As will be appreciated in the art, once standard ET2 and GABRE protein levels 
or mRNA levels are known, they can be used repeatedly as a standard for 
comparison. 
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By "biological sample" is intended any biological sample obtained from an 
individual, cell line, tissue culture, or other source which contains ET2 and/or 
GABRE protein or mRNA. Biological samples include mammalian body fluids 
(e.g., sera, plasma, urine, synovial fluid and spinal fluid) and tissues (e.g., 
testicular, heart, prostate, placenta, small intestine and colon tissues) which 
contain ET2 and/or GABRE protein or mRNA. Methods for obtaining tissue 
biopsies and body fluids from mammals are well known in the art. Where the 
biological sample is to include mRNA, a tissue biopsy is the preferred source. 

The present invention is useful for detecting diseases in mammals. In 
particular the invention is believed to be useful in diagnosing disease states in 
mammals, including schizophrenia, Huntington's Chorea, alcoholism, muscle 
spasms and rigidity, sleep disorders, seizure disorders (e.g., epilepsy), anxiety 
related disorders, AIDS related dementia, Alzheimer's disease, and Parkinson's 
disease. Preferred mammals include monkeys, apes, cats, dogs, cows, pigs, 
horses, rabbits and humans. Particularly preferred mammals are humans. 

Total cellular RNA can be isolated from a biological sample using any 
suitable technique such as the single-step guanidinium-thiocyanate-phenol- 
chloroform method described in Chomczynski and Sacchi, Anal Biochem. 
7(52:156-159 (1987). Levels of mRNA encoding the ET2 and/or GABRE 
receptor subunit are then assayed using any appropriate method. These include 
Northern blot analysis, SI nuclease mapping, the polymerase chain reaction 
(PCR), reverse transcription in combination with the polymerase chain reaction 
(RT-PCR), and reverse transcription in combination with the ligase chain reaction 
(RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. } Cell 
63:303-3 12 (1990). Briefly, total RNA is prepared from a biological sample as 
described above. For the Northern blot, the RNA is denatured in an appropriate 
buffer (such as glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to 
agarose gel electrophoresis, and transferred onto a nitrocellulose filter. After the 
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RNAs have been linked to the filter by a UV linker, the filter is prehybridized in 
a solution containing formamide, SSC, Denhardt's solution, denatured salmon 
sperm, SDS, and sodium phosphate buffer. ET2 and/or GABRE receptor subunit 
cDNA labeled according to any appropriate method (such as the 32 P-multiprimed 
DNA labeling system (Amersham)) is used as probe. After hybridization 
overnight, the filter is washed and exposed to x-ray film. cDNA for use as probe 
according to the present invention is described in the sections above and will 
preferably at least 15 bp in length. 

SI mapping can be performed as described in Fujita et al, Cell 49:357- 
367 (1987). To prepare probe DNA for use in SI mapping, the sense strand of 
above-described cDNA is used as a template to synthesize labeled antisense DNA. 
The antisense DNA can then be digested using an appropriate restriction 
endonuclease to generate further DNA probes of a desired length. Such antisense 
probes are useful for visualizing protected bands corresponding to the target 
mRNA (i.e., mRNA encoding the ET2 and/or GABRE receptor subunit). 
Northern blot analysis can be performed as described above. 

Preferably, levels of mRNA encoding the ET2 and/or GABRE receptor 
subunit are assayed using the RT-PCR method described in Makino et al s 
Technique 2:295-301 (1990). By this method, the radioactivities of the 
"amplicons" in the polyacrylamide gel bands are linearly related to the initial 
concentration of the target mRNA. Briefly, this method involves adding total 
RNA isolated from a biological sample in a reaction mixture containing a RT 
primer and appropriate buffer. After incubating for primer annealing, the mixture 
can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and reverse 
transcriptase. After incubation to achieve reverse transcription of the RNA, the 
RT products are then subject to PCR using labeled primers. Alternatively, rather 
than labeling the primers, a labeled dNTP can be included in the PCR reaction 
mixture. PCR amplification can be performed in a DNA thermal cycler according 
to conventional techniques. After a suitable number of rounds to achieve 
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amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide 
gel. After drying the gel, the radioactivity of the appropriate bands 
(corresponding to the mRNA encoding the ET2 and/or GABRE receptor subunit) 
is quantified using an imaging analyzer. RT and PCR reaction ingredients and 
conditions, reagent and gel concentrations, and labeling methods are well known 
in the art. Variations on the RT-PCR method will be apparent to the skilled 
artisan. 

Any set of oligonucleotide primers which will amplify reverse transcribed 
target mRNA can be used and can be designed as described in the sections above. 

Assaying ET2 and/or GABRE protein levels in a biological sample can 
occur using any art-known method. Preferred for assaying ET2 and/or GABRE 
protein levels in a biological sample are antibody-based techniques. For example, 
ET2 and/or GABRE protein expression in tissues can be studied with classical 
immunohistological methods. In these, the specific recognition is provided by the 
primary antibody (polyclonal or monoclonal) but the secondary detection system 
can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a 
result, an immunohistological staining of tissue section for pathological 
examination is obtained. Tissues can also be extracted, e.g. , with urea and neutral 
detergent, for the liberation of ET2 and/or GABRE protein for Western-blot or 
dot/slot assay (Jalkanen, ML, et al, J. Cell Biol 707:976-985 (1985); Jalkanen, 
M., etal,J. Cell. Biol 705:3087-3096(1987)). In this technique, which is based 
on the use of cationic solid phases, quantitation of ET2 and/or GABRE protein 
can be accomplished using isolated ET2 and/or GABRE protein as a standard. 
This technique can also be applied to body fluids. With these samples, molar 
concentrations of ET2 and GABRE proteins will aid to set standard values of ET2 
and GABRE protein content for different body fluids, like serum, plasma, urine, 
spinal fluid, etc. The normal appearance of ET2 and GABRE protein amounts can 
then be set using values from healthy individuals, which can be compared to those 
obtained from a test subject. 
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Other antibody-based methods useful for detecting ET2 and GABRE 
receptor subunit expression include immunoassays, such as the enzyme linked 
immunosorbent assay (ELISA) and the radioimmunoassay (RIA). For example, 
monoclonal antibodies with specificity for an ET2 or GABRE receptor subunit can 
be used both as an immunoabsorbent and as an enzyme-labeled probe to detect 
and quantify the ET2 or GABRE protein. The amount of ET2 or GABRE protein 
present in the sample can be calculated by reference to the amount present in a 
standard preparation using a linear regression computer algorithm. Such an 
ELISA for detecting a tumor antigen is described in Iacobelli et al, Breast Cancer 
Research and Treatment 77:19-30 (1 988). In another ELISA assay, two distinct 
specific monoclonal antibodies can be used to detect either ET2 or GABRE 
protein in a body fluid. In this assay, one of the antibodies is used as the 
immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting ET2 and/or GABRE 
protein with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before 
contacting the mixture with the labeled antibody. Other conventional methods 
may also be employed as suitable. It is usually desirable to immobilize one 
component of the assay system on a support, thereby allowing other components 
of the system to be brought into contact with the component and readily removed 
from the sample. 

Suitable enzyme labels include, for example, those from the oxidase group, 
which catalyze the production of hydrogen peroxide by reacting with substrate. 
Glucose oxidase is particularly preferred as it has good stability and its substrate 
(glucose) is readily available. Activity of an oxidase label may be assayed by 
measuring the concentration of hydrogen peroxide formed by the enzyme-labeled 
antibody/substrate reaction. Besides enzymes, other suitable labels include 
radioisotopes, such as iodine ( 125 1, 121 I), carbon ( 14 C), sulphur ( 35 S), tritium ( 3 H), 
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indium ( 112 In), and technetium ( 99m Tc), and fluorescent labels, such as fluorescein 
and rhodamine, and biotin. 

In addition to assaying ET2 and/or GABRE protein levels in a biological 
sample obtained from an individual, ET2 and GABRE proteins can also be 
detected in vivo by imaging. Antibody labels or markers for in vivo imaging of 
ET2 and/or GABRE protein include those detectable by X-radiography , NMR or 
ESR. For X-radiography , suitable labels include radioisotopes such as barium or 
cesium, which emit detectable radiation but are not overtly harmful to the subject. 
Suitable markers for NMR and ESR include those with a detectable characteristic 
spin, such as deuterium, which may be incorporated into the antibody by labeling 
of nutrients for the relevant hybridoma. 

An ET2 and/or GABRE receptor subunit-specific antibody or antibody 
fragment which has been labeled with an appropriate detectable imaging moiety, 
such as a radioisotope (for example, 131 I, m In, 99m Tc), a radio-opaque substance, 
or a material detectable by nuclear magnetic resonance, is introduced (for 
example, parenterally , subcutaneously or intraperitoneally) into the mammal to be 
examined for cancer. It will be understood in the art that the size of the subject 
and the imaging system used will determine the quantity of imaging moiety needed 
to produce diagnostic images. In the case of a radioisotope moiety, for a human 
subject, the quantity of radioactivity injected will normally range from about 5 to 
20 millicuries of 99m Tc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain ET2 and/or 
GABRE protein. In vivo tumor imaging is described in S.W. Burchiel et ah, 
"Immunopharmacokinetics of Radiolabeled Antibodies and Their Fragments" 
(Chapter 13 in Tumor Imaging: The Radiochemical Detection of Cancer^ S.W. 
Burchiel and B.A. Rhodes, eds., Masson Publishing Inc. (1982)). 

ET2 and GABRE receptor subunit-specific antibodies for use in the 
present invention can be raised against the intact ET2 or GABRE receptor subunit 
or an antigenic polypeptide fragment thereof, which may presented together with 
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a carrier protein, such as an albumin, to an animal system (such as rabbit or 
mouse) or 5 if it is long enough (at least about 25 amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (MAb) 
is meant to include intact molecules as well as antibody fragments (such as, for 
example, Fab and F(ab r ) 2 fragments) which are capable of specifically binding to 
the ET2 and/or GABRE receptor subunit. Fab and F(ab') 2 fragments lack the Fc 
fragment of intact antibody, clear more rapidly from the circulation, and may have 
less non-specific tissue binding of an intact antibody (Wahl et ah, J. Nuch Med. 
24:3 16-325 (1983)). Thus, these fragments are preferred. 

The antibodies of the present invention may be prepared by any of a variety 
of methods. For example, cells expressing the ET2 and/or GABRE receptor 
subunit, or an antigenic fragment thereof, can be administered to an animal in 
order to induce the production of sera containing polyclonal antibodies. In a 
preferred method, a preparation of either ET2 or GABRE protein is prepared and 
purified to render it substantially free of natural contaminants. Such a preparation 
is then introduced into an animal in order to produce polyclonal antisera of greater 
specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or ET2 and/or GABRE receptor subunit binding 
fragments thereof). Such monoclonal antibodies can be prepared using hybridoma 
technology (Kohler etal, Nature 256:495 (1975); Kohler etal } Eur. J. Immunol 
6:5l \ (1976); Kohler et ah, Eur. J. Immunol. 6:292 (1976); Hammerling etal, 
In: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N.Y., (1981) pp. 
563-68 1). In general, such procedures involve immunizing an animal (preferably 
a mouse) with an ET2 or GABRE receptor subunit antigen or, more preferably, 
with an ET2 or GABRE receptor subunit-expressing cell. Suitable cells can be 
recognized by their capacity to bind anti-ET2 or anti-GABRE receptor subunit 
antibody. Such cells may be cultured in any suitable tissue culture medium; 
however, it is preferable to culture cells in Earle's modified Eagle ! s medium 
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supplemented with 10% fetal bovine serum (inactivated at about 56°C), and 
supplemented with about 1 0 g/1 of nonessential amino acids, about 1 ,000 U/ml of 
penicillin, and about 1 00 (ig/ml of streptomycin. The splenocytes of such mice are 
extracted and fused with a suitable myeloma cell line. Any suitable myeloma cell 
line may be employed in accordance with the present invention; however, it is 
preferable to employ the parent myeloma cell line (SP 2 0), available from the 
American Type Culture Collection, Rockville, Maryland. After fusion, the 
resulting hybridoma cells are selectively maintained in HAT medium, and then 
cloned by limiting dilution as described by Wands et al {Gastroenterology 
£0:225-232 (1 98 1)). The hybridoma cells obtained through such a selection are 
then assayed to identify clones which secrete antibodies capable of binding to an 
ET2 and/or GABRE receptor subunit antigen. 

Alternatively, additional antibodies capable of binding to an ET2 and/or 
GABRE receptor subunit antigen may be produced in a two-step procedure 
through the use of anti-idiotypic antibodies. Such a method makes use of the fact 
that antibodies are themselves antigens, and that, therefore, it is possible to obtain 
an antibody which binds to a second antibody. In accordance with this method, 
ET2 and/or GABRE receptor subunit-specific antibodies are used to immunize an 
animal, preferably a mouse. The splenocytes of such an animal are then used to 
produce hybridoma cells, and the hybridoma cells are screened to identify clones 
which produce an antibody whose ability to bind to the ET2 and/or GABRE 
receptor subunit-specific antibody can be blocked by the ET2 and/or GABRE 
receptor subunit antigen. Such antibodies comprise anti-idiotypic antibodies to the 
ET2 and/or GABRE receptor subunit-specific antibody and can be used to 
immunize an animal to induce formation of further ET2 and/or GABRE receptor 
subunit-specific antibodies. 

It will be appreciated that Fab and F(ab') 2 and other fragments of the 
antibodies of the present invention may be used according to the methods 
disclosed herein. Such fragments are typically produced by proteolytic cleavage, 
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using enzymes such as papain (to produce Fab fragments) or pepsin (to produce 
F(ab') 2 fragments). Alternatively, ET2 and/or GABRE receptor subunit-binding 
fragments can be produced through the application of recombinant DNA 
technology or through synthetic chemistry. 

Where in vivo imaging is used to detect enhanced levels of ET2 and/or 
GABRE protein in humans, it may be preferable to use "humanized" chimeric 
monoclonal antibodies. Such antibodies can be produced using genetic constructs 
derived from hybridoma cells producing the monoclonal antibodies described 
above. Methods for producing chimeric antibodies are known in the art. See, for 
review, Morrison, Science 229:1202 (1985); Oi et al, BioTechniques 4:214 
(1986); Cabilly et al, U.S. Patent No. 4,816,567; Taniguchi etal, EP 171496; 
Morrison etal, EP 173494;Neuberger^a/., WO 8601533; Robinson etal, WO 
8702671; Boulianne et al, Nature 312:643 (1984); Neuberger et al, Nature 
574:268(1985). 

Further suitable labels for the ET2 and/or GABRE receptor 
subunit-specific antibodies of the present invention are provided below. Examples 
of suitable enzyme labels include malate dehydrogenase, staphylococcal nuclease, 
delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol phosphate 
dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, m In, 125 I, 131 I, 32 P, 
35 S, 14 C, 51 Cr, 57 To, 58 Co, 59 Fe, 75 Se, 152 Eu, 90 Y, 67 Cu, 217 Ci, 211 At, 212 Pb, 47 Sc, 109 Pd, 
etc. m In is a preferred isotope where in vivo imaging is used since its avoids the 
problem of dehalogenation of the 125 I or I31 I-labeled monoclonal antibody by the 
liver. In addition, this radionucleotide has a more favorable gamma emission 
energy for imaging (Perkins et al, Eur. J. Nucl Med 70:296-301 (1985); 
Carasquillo^a/., J. Nucl Med. 25:281-287 (1987)). For example, 1H In coupled 
to monoclonal antibodies with 1 -(P-isothiocyanatobenzyl)-DPTA has shown little 
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uptake in non-tumorous tissues, particularly the liver, and therefore enhances 
specificity of tumor localization (Esteban et al, J. Nucl Med. 25:861-870 
(1987)). 

Examples of suitable non-radioactive isotopic labels include 157 Gd, 55 Mn, 
162 Dy 5 52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an 152 Eu label, a fluorescein 
label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a 
phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, and a 
fluorescamine label. 

Examples of suitable toxin labels include diphtheria toxin, ricin, and 
cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt 
label, an oxalate ester label, a luciferin label, a luciferase label, and an aequorin 
label. 

Examples of nuclear magnetic resonance contrasting agents include heavy 
metal nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are 
provided by Kennedy et al, Clin. Chim. Acta 70:1-31 (1976), and Schurs et al, 
Clin. Chim. Acta 81 : 1 -40 (1 977). Coupling techniques mentioned in the latter are 
the glutaraldehyde method, the periodate method, the dimaleimide method, the 
m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods 
are incorporated by reference herein. 

Agonists and Antagonists o/ET2 and GABRE Receptor Subunit Activity 

Variations in the subunit composition of the GAB A A receptor complex can 
alter ligand induced responses (reviewed in Whiting, P. etal f Int. Rev. Neurobiol 
J£:95-138 (1995)). The presence of a y subunit in a GABA A receptor complex, 
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for example, confers sensitivity to benzodiazepines upon the complex while only 
marginally reducing the complex's affinity for GABA. Further, the presence of a 
GABRE subunit inhibits the ability of anesthetic agents to potentiate GAB A-gated 
chloride currents {see Example 6). In a similar fashion, the integration of the ET2 
receptor subunit is believed to modulate GABA A receptor complex activity by 
altering the binding affinity of this complex to various ligands. The ET2 and 
GABRE receptor subunits are further believed to enhance the binding affinity of 
GABA A receptor complex to ligands which do not generally bind the complex 
with high affinity. 

In one aspect, the present invention is directed to a method for inhibiting 
an activity of the ET2 and/or GABRE receptor subunit {e.g., binding of the ET2 
and/or GABRE receptor subunit to members of the GABA A receptor complex, 
modulation of GABA A receptor complex activities), which involves administering 
to a cell which expresses an ET2 and/or GABRE polypeptide an effective amount 
of an antagonist of ET2 and/or GABRE receptor subunit activity. Antagonists can 
include soluble forms of the ET2 or GABRE receptor subunit, antibodies directed 
against the ET2 or GABRE polypeptide, and antisense molecules. 

In a further aspect, the present invention is directed to a method for 
increasing ET2 and/or GABRE receptor subunit activity, which involves 
administering to a cell which expresses an ET2 and/or GABRE polypeptide an 
effective amount of an agonist. Agonists can include soluble forms of the ET2 or 
GABRE receptor subunit. 

By "agonist" is intended naturally occurring and synthetic compounds 
capable of enhancing one or more activity of the ET2 and/or GABRE receptor 
subunit. Such agonists include modified forms of the ET2 or GABRE polypeptide 
and agents which increase expression of ET2 and/or GABRE receptor subunits. 
By "agonistic activity" is intended the enhancement of one or more ET2 and/or 
GABRE receptor subunit activities. By "antagonist" is intended naturally 
occurring and synthetic compounds capable of inhibiting one or more activity of 
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the ET2 and/or GABRE receptor subunit Such antagonists include modified 
forms of the ET2 or GABRE polypeptide and agents which decrease expression 
of ET2 and/or GABRE receptor subunits. By "antagonistic activity" is intended 
the inhibition of one or more ET2 and/or GABRE receptor subunit activities. 
Whether any candidate "agonist" or "antagonist" of the present invention can 
enhance or inhibit ET2 and/or GABRE receptor subunit activity can be determined 
using art-known assays, including those described in more detail below. 

Thus, in a further aspect, a screening method is provided for determining 
whether a candidate compound is capable of enhancing or inhibiting ET2 and/or 
GABRE receptor subunit activity. The method involves contacting cells which 
express (1) ET2 and/or GABRE receptor subunit polypeptides and (2) other 
components of the GABA A receptor complex with (1) a molecule known to elicit 
a GABA A receptor complex induced cellular response (e.g., a GABA A receptor 
complex ligand), referred to herein as a "stimulatory molecule," and (2) a 
candidate compound, assaying a cellular response, and comparing the cellular 
response to a first standard cellular response, the first standard cellular response 
being assayed when contact is made with the stimulatory molecule in absence of 
the candidate compound, whereby an increased cellular response over the first 
standard indicates that the candidate compound is a potential agonist of ET2 
and/or GABRE receptor subunit activity and a decreased cellular response 
compared to the first standard indicates that the candidate compound is a potential 
antagonist of ET2 and/or GABRE receptor subunit activity. A final determination 
of whether the candidate compound is an agonist or antagonist of ET2 and/or 
GABRE receptor subunit activity is made by comparing the effect this compound 
has on GABA A receptor complex activity to a second standard cellular response, 
the second standard cellular response being assayed when cells are contacted with 
the stimulatory molecule and candidate compound in the absence of the ET2 
and/or GABRE receptor subunit, wherein agonistic or antagonistic activity which 
occurs in the presence of ET2 and/or GABRE but not in its absence indicates that 
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the candidate compound alters GABA A receptor complex function in conjugation 
with the ET2 and/or GABRE receptor subunit 

By "assaying a cellular response" is intended qualitatively or quantitatively 
measuring a cellular response to a candidate compound (e.g. , alteration of binding 
of the ET2 and/or GABRE polypeptide to other members of the GAB A A receptor 
complex, GABA A receptor complex activation). By the invention, a cell 
expressing an ET2 and/or GABRE receptor subunit polypeptide can be contacted 
with either an endogenously or exogenously administered stimulatory molecule. 

By a M GABA A receptor complex ligand" is intended naturally occurring, 
recombinant, and synthetic ligands that are capable of binding to a GABA A 
receptor complex and inducing the ligand/receptor signaling pathway (e.g., 
GAB A A receptor complex activation). Also intended are compounds which bind 
to the GAB A A receptor complex and may be used to assay ET2 and/or GABRE 
receptor subunit activity. GABA A receptor complex ligands include, but are not 
limited to, GAB A, benzodiazepines, steroids (e.g., dehydroepiandrosterone), 
barbiturates, picrotoxin, ethanol, loreclezole, anesthetics, and zinc. 

In an additional aspect, a screening method is provided for determining 
whether a candidate compound is capable of enhancing or inhibiting ET2 and/or 
GABRE receptor subunit activity, wherein the candidate compound and 
stimulatory molecule are the same. This method involves contacting cells which 
express (1) the ET2 and/or GABRE receptor subunit and (2) the GABA A receptor 
complex with a candidate compound, assaying a cellular response, and comparing 
the cellular response to a standard cellular response, the standard being assayed 
when contact is made with the stimulatory molecule in absence of the ET2 and/or 
GABRE receptor subunit, whereby a difference in the cellular response with 
respect to the standard indicates that the candidate compound is either an agonist 
or antagonist of ET2 and/or GABRE receptor subunit activity. 

ET2 and/or GABRE receptor subunit activity can be measured in cells co- 
transfected with genetic material encoding the ET2 and/or GABRE polypeptide 
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and additional GABA A receptor subunits (e.g., a and p subunits) followed by 
measurement of chloride currents activation in response to administration of a 
stimulatory molecule (e.g. , a GAB A A receptor complex ligand). Further, as noted 
above, antagonists and agonists of ET2 and/or GABRE receptor subunit activity 
can be identified by comparing the cellular response in the presence of both a 
stimulatory molecule and a candidate compound, wherein antagonistic or agnostic 
activity towards the cellular response in the presence of the ET2 and/or GABRE 
receptor subunit but not in their absence indicates that the candidate compound 
is either an antagonist or agonist of ET2 and/or GABRE receptor subunit activity. 

Another method for identifying antagonists and agonists of ET2 and/or 
GABRE receptor subunit activity involves screening for compounds which either 
enhance or inhibit binding of labeled GABA A receptor complex ligand molecules 
(e.g. , GAB A) to cells which have the receptor complex on their surfaces. This 
method involves transfecting a cell with genetic material encoding (1) the ET2 
and/or GABRE receptor subunit and (2) additional members of the GABA A 
receptor complex, such that the cell expresses GAB A A receptor complex binding 
sites on its surface, and contacting the cell with a candidate compound in the 
presence of a labeled form of a GAB A A receptor complex ligand, e.g., labeled with 
a radioactive isotope. The amount of labeled ligand bound to the receptor 
complex is measured. If the compound, for example, competitively binds to the 
receptor, as determined by a reduction of labeled ligand which binds to the 
receptors, the binding of labeled ligand to the receptor is inhibited. As above, a 
determination of whether the candidate compound exerts its influence on the 
binding of a ligand to the GAB A A receptor complex via the ET2 and/or GABRE 
receptor subunit will involve the measurement of GAB A A receptor complex ligand 
binding in the presence of the candidate compound and both in the presence and 
absence of the ET2 and/or GABRE receptor subunit. 

Soluble forms of the polypeptides of the present invention may be utilized 
in the ligand binding assay described above. Such forms of the ET2 and/or 
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GABRE receptor subunit are candidate agonists and antagonists of the present 
invention and may alter ligand binding to the GAB A A receptor complex by binding 
to either the labeled GABA A receptor complex ligand (e.g., GAB A) or to the 
GABA A receptor complex itself. 

Antagonist according to the present invention include soluble forms of the 
ET2 and/or GABRE receptor subunits (e.g., fragments of the ET2 or GABRE 
receptor subunit shown in SEQ ID NO:2 and SEQ ID NO:42, respectively). Such 
soluble forms of the receptor, which may be naturally occurring or synthetic and 
may antagonize ET2 and/or GABRE receptor subunit activity by binding to either 
the GABA A receptor complex or stimulatory molecules. Antagonists of the 
present invention also include antibodies specific for the ET2 and/or GABRE 
receptor subunit and ET2 and/or GABRE receptor subunit-Fc fusion proteins 
which contain regions of the ET2 and/or GABRE polypeptide required for binding 
to the GAB A A receptor complex. Techniques for producing antibodies which bind 
to a protein of interest and fusion proteins are well known in the art and are 
described, for example, in Ausbel, F. et al , eds., Current Protocols in Molecular 
Biology, John Wiley & Sons, Inc. (1997) (see, e.g., Chapters 11 and 16), the 
entire disclosure of which is hereby incorporated herein by reference. 

Antibodies of the present invention may be prepared by any of a variety of 
methods using ET2 or GABRE receptor subunit immunogens of the present 
invention. Such ET2 or GABRE receptor subunit immunogens include the ET2 
and GABRE receptor subunit proteins shown in SEQ ID NO:2 and SEQ ID 
NO:42 and polypeptide fragments of the ET2 and GABRE receptor subunits 
comprising the extracellular, intracellular and transmembrane domains, or any 
combination thereof. 

Polyclonal and monoclonal antibody agonist or antagonist of the present 
invention can be raised according to the methods disclosed in Tartaglia and 
Goeddel, J. Biol Chem. 267:4304-4307(1992)); Tartaglia etal, Cell 73:213-216 
(1993)), and PCT Application WO 94/09137. 
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In a preferred method, antibodies according to the present invention are 
mAbs. Such mAbs can be prepared using hybridoma technology (Kohler and 
Millstein, Nature 256:495-497 (1975) and U.S. Patent No. 4,376,1 10; Harlow et 
al, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1988; Monoclonal Antibodies andHybridomas: A New 
Dimension in Biological Analyses, Plenum Press, New York, NY, 1980; 
Campbell, "Monoclonal Antibody Technology," In: Laboratory Techniques in 
Biochemistry and Molecular Biology, Volume 1 3 (Burdon et al , eds.), Elsevier, 
Amsterdam (1984)). 

Proteins and other compounds which bind the ET2 and/or GABRE 
receptor subunit domains are also candidate agonist and antagonist according to 
the present invention. Such binding compounds can be "captured" using the yeast 
two-hybrid system (Fields and Song, Nature 340:245-246 (1989)). A modified 
version of the yeast two-hybrid system has been described by Roger Brent and his 
colleagues (Gyuris, J. et al, Cell 75:791-803 (1993); Zervos, A.S. et al, Cell 
72:223-232 (1993)). Preferably, the yeast two-hybrid system is used according 
to the present invention to capture compounds which bind to one or more domains 
of the ET2 and/or GABRE receptor subunits. Such compounds are good 
candidate agonists and antagonists of the present invention. 

Using the two-hybrid assay described above, the domains of the ET2 and 
GABRE receptor subunits, or portions thereof, may be used to identify cellular 
proteins which interact with the receptor subunit in vivo. Such an assay may also 
be used to identify ligands with potential agonistic or antagonistic activity of ET2 
and/or GABRE receptor subunit function. This screening assay has previously 
been used to identify proteins which interact with the cytoplasmic domain of the 
murine TNF-RII and led to the identification of two receptor associated proteins. 
Rothe, M. et al, Cell 78:681 (1994). Such proteins and amino acid sequences 
which bind to the cytoplasmic and transmembrane domains of the ET2 and 
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GABRE receptor subunits are good candidate agonist and antagonist of the 
present invention. 

Further, the effect that a candidate compound has on the binding of the 
ET2 and/or GABRE receptor subunit to the GABA A receptor complex ligands in 
vitro may also be measured to identify compounds which are agonists and 
antagonists of ET2 and/or GABRE receptor subunit activity. Techniques for 
measuring such binding activity are described, for example, in Ausbel, F. et al 9 
eds. 5 Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1997), 
Chapter 20, and include binding assays using GST fusion proteins. 

ET2 and GABRE receptor subunit agonists and antagonists of the present 
invention may be employed to stimulate GAB A A receptor complex ligand induced 
activities, such as the modulation of GABA A receptor complex activity. Thus, the 
agonists and antagonists of the present invention may be employed to alter the 
cellular response of the GAB A A receptor complex to GABA A receptor complex 
ligands. 

Therapeutics and Modes of Administration 

The ET2 and GABRE receptor subunit are each believed to be involved 
in a number of disease states involving the GABA A receptor complex {e.g., 
schizophrenia, Huntington's Chorea, muscle spasms and rigidity, sleep disorders, 
seizure disorders (e.g., epilepsy), alcoholism, anxiety related disorders, AIDS 
related dementia, Alzheimer's disease, Parkinson's disease, and additional diseases 
associated with neurological function). ET2 and GABRE receptor subunit 
involvement in a disease state may result from either underexpression or 
overexpression of both or one of these receptor subunits, increased or decreased 
sensitivity of the GABA A receptor complex to a ligand, or either enhanced or 
decreased binding of the ET2 and/or GABRE polypeptide to the GAB A A receptor 
complex. Thus, important targets for inhibitors of ET2 and GABRE receptor 
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subunit functions are ligand binding domains and domains necessary for interaction 
with other members of the GAB A A receptor complex. Therefore, included within 
the scope of the invention are molecules capable of inhibiting or stimulating ET2 
and/or GABRE receptor subunit activity. Such molecules include antibodies 
specific for the ET2 and/or GABRE polypeptide and derivatives of the ET2 and/or 
GABRE receptor subunit. Suitable derivatives of the ET2 and/or GABRE 
receptor subunit include all of the domains of these proteins and fragments 
thereof. 

It will be appreciated that conditions caused by a decrease in the standard 
or normal level of ET2 and/or GABRE receptor subunit activity in an individual, 
can be treated by administration of ET2 and/or GABRE protein or an agonist of 
ET2 and/or GABRE receptor subunit activity. Thus, the invention further 
provides a method of treating an individual in need of an increased level of ET2 
and/or GABRE receptor subunit activity comprising administering to such an 
individual a pharmaceutical composition comprising an effective amount of an 
isolated ET2 and/or GABRE polypeptide of the invention, or agonist of ET2 
and/or GABRE receptor subunit activity, effective to increase the ET2 and/or 
GABRE receptor subunit activity level in such an individual. 

It will further be appreciated that conditions caused by an increase in the 
standard or normal level of ET2 and/or GABRE receptor subunit activity in an 
individual, can be treated by administration of an antagonist of ET2 and/or 
GABRE receptor subunit activity. Thus, the invention further provides a method 
of treating an individual in need of a decreased level of ET2 and/or GABRE 
receptor subunit activity comprising administering to such an individual a 
pharmaceutical composition comprising an effective amount of an antagonist of 
ET2 and/or GABRE receptor subunit activity, effective to decrease the ET2 
and/or GABRE receptor subunit activity level in such an individual. 

The composition containing the ET2 and/or GABRE polypeptide, agonist, 
or antagonist will be formulated and dosed in a fashion consistent with good 
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medical practice, taking into account the clinical condition of the individual patient 
(especially the side effects of the treatment), the site of delivery of the 
composition, the method of administration, the scheduling of administration, and 
other factors known to practitioners. The "effective amount' 1 of an ET2 and/or 
GABRE polypeptide, agonist, or antagonist for purposes herein, is thus 
determined by such considerations. 

Further, compositions containing an ET2 and/or GABRE polypeptide, 
agonist, or antagonist may contain mixtures of different molecules formulated to 
either increase or decrease GABA A receptor complex activity. For example, an 
ET2 and/or GABRE polypeptide may be combined with an agonist of ET2 and/or 
GABRE receptor subunit activities to treat an affliction associated with decreased 
ET2 and/or GABRE receptor subunit activity. 

As a general proposition, the total pharmaceutically effective amount of 
ET2 and/or GABRE polypeptide, agonist, or antagonist administered parenterally 
per dose will be in the range of about 1 jag/kg/day to 10 mg/kg/day of patient 
body weight, although, as noted above, this will be subject to therapeutic 
discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If 
given continuously, an ET2 polypeptide, a GABRE polypeptide, an agonist, an 
antagonist, or combination thereof will typically be administered at a dose rate of 
about 1 (ag/kg/hour to about 50 |Lig/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An 
intravenous bag solution may also be employed. The length of treatment needed 
to observe changes and the interval following treatment for responses to occur 
appears to vary depending on the desired effect. 

Pharmaceutical compositions containing an ET2 polypeptide, a GABRE 
polypeptide, an agonist, an antagonist, or combination thereof may be 
administered orally, rectally, parenterally, intracistemally, intravaginally, 
intraperitoneally, topically (as by powders, ointments, drops or transdermal patch), 
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bucally, or as an oral or nasal spray. By "pharmaceutically acceptable carrier" is 
meant a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material 
or formulation auxiliary of any type. The term "parenteral" as used herein refers 
to modes of administration which include intravenous, intramuscular, 
intraperitoneal, intrasternal, subcutaneous and intraarticular injection and infusion. 

The ET2 polypeptide, GABRE polypeptide, agonist, antagonist, or 
combination thereof is also suitably administered by sustained-release systems. 
Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in the form of shaped articles, e.g., films, or microcapsules. 
Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 
58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, 
U. et al, Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacrylate) 
(R. Langer et al, J. Biomed. Mater. Res. 75:167-277 (1981), and R. Langer, 
Chem. Tech. 72:98-105 (1982)), ethylene vinyl acetate (R. Langer et al, Id) or 
poly-D- (-)-3-hydroxybutyricacid(EP 133,988). Sustained-release compositions 
also include liposomally entrapped ET2 and/or GABRE polypeptide, agonist, or 
antagonist compositions. Liposomes containing an ET2 polypeptide, a GABRE 
polypeptide, an agonist, an antagonist, or combination thereof are prepared by 
methods known per se: DE 3,218,121; Epstein et al, Proc. Natl Acad. Set 
(USA) 52:3688-3692 (1985); Hwang et al, Proc, Natl Acad. Set (USA) 
77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 
4,544,545; and EP 102,324. Ordinarily, the liposomes are of the small (about 
200-800 Angstroms) unilamellar type in which the lipid content is greater than 
about 30 mol. percent cholesterol, the selected proportion being adjusted for the 
optimal effectiveness of the therapy. 

For parenteral administration, in one embodiment, the composition to be 
administered is formulated generally by mixing it at the desired degree of purity, 
in a unit dosage injectable form (solution, suspension, or emulsion), with a 
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pharmaceutically acceptable carrier, i.e., one that is non-toxic to recipients at the 
dosages and concentrations employed and is compatible with other ingredients of 
the formulation. For example, the formulation preferably does not include 
oxidizing agents and other compounds that are known to be deleterious to 
polypeptides. 

Generally, the formulations are prepared by contacting the ET2 
polypeptide, GABRE polypeptide, agonist, antagonist, or combination thereof 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably 
the carrier is a parenteral carrier, more preferably a solution that is isotonic with 
the blood of the recipient. Examples of such carrier vehicles include water, saline, 
Ringer's solution, and dextrose solution. Non-aqueous vehicles such as fixed oils 
and ethyl oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 
toxic to recipients at the dosages and concentrations employed, and include 
buffers such as phosphate, citrate, succinate, acetic acid, and other organic acids 
or their salts; antioxidants such as ascorbic acid; low molecular weight (less than 
about ten residues) polypeptides (e.g., polyarginine or tripeptides; proteins, such 
as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, 
or arginine; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, mannose, or dextrins; chelating agents such 
as EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as 
sodium; and/or nonionic surfactants such as polysorbates, poloxamers, or PEG). 

The ET2 polypeptide, GABRE polypeptide, agonist, antagonist, or 
combination thereof is typically formulated in such vehicles at a concentration of 
about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. 
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It will be understood that the use of certain of the foregoing excipients, carriers, 
or stabilizers will result in the formation of salts. 

An ET2 polypeptide, a GABRE polypeptide, an agonist, an antagonist, or 
combination thereof to be used for therapeutic administration must be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes 
(e.g. , 0.2 micron membranes). Therapeutic compositions generally are placed into 
a container having a sterile access port, for example, an intravenous solution bag 
or vial having a stopper pierceable by a hypodermic injection needle. 

ET2 polypeptides, GABRE polypeptides, agonists, antagonists, or 
combination thereof ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampules or vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a lyophilized formulation, 1 0-ml 
vials are filled with 5 ml of sterile-filtered aqueous solution containing 1% (w/v) 
of the composition to be administered, and the resulting mixture is lyophilized. 
The infusion solution is prepared by reconstituting the lyophilized composition 
using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one 
or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice 
in the form prescribed by a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, which notice reflects approval 
by the agency of manufacture, use or sale for human administration. In addition, 
the polypeptides of the present invention may be employed in conjunction with 
other therapeutic compounds. 

Chromosome Assays 

The nucleic acid molecules of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
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hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal location. 
The mapping of DNAs to chromosomes according to the present invention is an 
important first step in correlating those sequences with genes associated with 
disease. 

In certain preferred embodiments in this regard, the cDNA herein disclosed 
is used to clone genomic DNA encoding the ET2 and GABRE receptor subunits. 
This can be accomplished using a variety of well known techniques and libraries, 
which generally are available commercially. The genomic DNA then is used for 
in situ chromosome mapping using well known techniques for this purpose. 
Typically, in accordance with routine procedures for chromosome mapping, some 
trial and error may be necessary to identify a genomic probe that gives a good in 
situ hybridization signal. 

In addition, in some cases, sequences can be mapped to chromosomes by 
preparingPCR primers (preferably 15-25 bp) from the cDNA. Computer analysis 
of the 3' untranslated region of the gene is used to rapidly select primers that do 
not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the primer will yield an amplified 
portion. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular DNA to a particular chromosome. Using the present invention with the 
same oligonucleotide primers, sublocalization can be achieved with panels of 
portions from specific chromosomes or pools of large genomic clones in an 
analogous manner. Other mapping strategies that can similarly be used to map to 
its chromosome include in situ hybridization, prescreening with labeled flow- 
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sorted chromosomes and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Fluorescence in situ hybridization ("FISH") of a cDNA clone to a 
metaphase chromosomal spread can be used to provide a precise chromosomal 
location in one step. This technique can be used with probes from the cDNA as 
short as 50 or 60 bp. For a review of this technique, see Verma et ah, Human 
Chromosomes: A Manual Of Basic Techniques, Pergamon Press, New York 
(1988). 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in V. McKusick, Mendelian 
Inheritance In Man, available on-line through Johns Hopkins University, Welch 
Medical Library. The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or genomic 
sequence between affected and unaffected individuals. If a mutation is observed 
in some or all of the affected individuals but not in any normal individuals, then the 
mutation is likely to be the causative agent of the disease. 

With current resolution of physical mapping and genetic mapping 
techniques, a cDNA precisely localized to a chromosomal region associated with 
the disease could be one of between 50 and 500 potential causative genes. This 
assumes 1 megabase mapping resolution and one gene per 20 kb. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 
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Examples 

Example 1; Alternative Transcripts of a Gene Encoding the 
GABA A Receptor e Subunit on Chromosome Xq28 

Introduction 

In mammalian brain, synaptic inhibition of neuronal activity is mediated 
mainly by the action of y-aminobutyric acid (GAB A) at GAB A A receptors. The 
GABA A receptors are oligomeric proteins, composed of homologous subunits, 
that can conduct chloride currents across the synaptic membrane. An extensive 
range of GABA A receptor subunits has been identified through the isolation and 
expression of cDNA clones. Several of these subunits have been shown to confer 
specific properties to the recombinant receptors in which they assemble. The 
spatial variation of subunit expression patterns in the brain is thought to account 
for the pharmacological diversity of this receptor family (reviewed in Macdonald, 
R.L., and Olsen, R.W., Annu. Rev. NeuroscL 1 7:569-602 (1 994); Whiting, P. J., et 
al f Int. Rev. Neurobiol 35:95-138 (1995)). 

Fourteen mammalian GAB A A receptor subunits have been classified within 
five families (al-6, pl-3, yl-3, 6, e) on the basis of amino acid homology 
(Whiting, P.J., et al. s Int. Rev. Neurobiol 35:95-138 (1995); Davies, PA., etal, 
Nature 555:820-823 (1997)). With the exception of the e subunit, all of the 
subunit genes have been assigned to human chromosomal regions. Most of the 
subunit genes are located in three clusters, on chromosomes 4, 5 and 1 5 (Buckle, 
V.J., et at, Neuron 3:647-654 (1989); Wagstaff, J., et al, Am. J. Hum. Genet. 
49-330-331 (1991); Wilcox, A.S., et al, Proc. Natl Acad Set USA 59:5857- 
5861 (1992); Knoll, J.H., etal, Hum. Mol Genet. 2:183-189 (1993); Russek, 
S.J., and Farb, D.H., Genomics 23:528-533 (1994); Hicks, A.A., etal, Genomics 
20:285-288 (1994); McLean, P.J., etal, Genomics 25:580-586 (1995); Greger, 
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V. 9 etal, Genomics 26:258-264 (1995)). Only GABRA3 (on chromosome Xq28) 
and GABRD (chromosome 1) appear to be isolated from other members of the 
gene family (Buckle, V.J., etal, Neuron 5:647-654 (1989); Sommer, B., et al, 
Cell Biol P:561-568 (1990)). 

The e subunit represents a distinct subunit class that was identified recently 
in human brain (Davies, P.A., et al, Nature 355:820-823 (1997)). This subunit 
confers several unique pharmacological and biophysical properties to the GAB A A 
receptors in which it assembles. Most notably, it blocks the ability of anesthetic 
agents to potentiate GABA-gated chloride currents. The e subunit transcript is 
expressed at particularly high levels in the subthalamic nucleus, and is therefore 
a potential target for pharmacological manipulation of neuronal pathways within 
the basal ganglia. 

Owing to the distinctive sequence of the e subunit protein, it was of 
interest to compare the characteristics of the human e subunit gene (GABRE) with 
those of related family members. Here, we describe the structure, location and 
expression patterns of GABRE. Unexpectedly, an alternative transcript of the 
gene was detected during the course of this study. The sequence and expression 
pattern of this mRNA indicate that its cellular role is distinct from that of the e 
subunit transcript. 

Materials and Methods 

Cloning and mapping of GABRE 

Two libraries of human genomic DNA, cloned in ADASH II and AFIX 
(Stratagene), were screened at high stringency (Kirkness, E.F., et al, Genomics 
1 0:985-995 (1 99 1 )) with two 32 P-labeled fragments of the human e subunit cDNA 
(nucleotides 20-328 and 695-1329 of GenBank Accession No. U66661 (SEQ ID 
NO:41)). Sixteen hybridizing clones were obtained from approximately 2 x 10 6 
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plaques. Overlapping inserts were determined by restriction fragment mapping 
and Southern blotting. Exons and flanking introns were sequenced from templates 
of purified A, DNA using the Dye Terminator Cycle Sequencing system (Applied 
Biosy stems). The sequences of three genomic fragments that contain all of the 
exons are shown in SEQ ID NO:3, SEQ ID NO:4, and SEQ ID NO:5. 

Sequencing of an e subunit gene fragment from rat 

The cDNA sequence of the rat e subunit was used to design primers for 
amplification of rat genomic DNA by PCR. Two overlapping fragments of the rat 
e subunit gene were amplified with primer pairs that correspond to nucleotides 1 - 
24 (forward), 2170-2193 (reverse) and 2 167-2 190 (forward), 5936-5959 (reverse) 
of SEQ ID NO:6. Inverse PCR (Ochman, H., et al, in "PCR Protocols", (Innis, 
M.A., et al, eds), Academic Press, San Diego, (1990), pp. 219-227) was used to 
obtain an overlapping fragment from the 3' end of the rat gene. For these 
reactions, the primers corresponded to nucleotides 6851 -6874 (reverse) and 6881- 
6904 (forward) in SEQ ID NO:6. Amplification at 95 °C for 45 s, 60 °C for 1 min, 
72 °C for 2 min was performed for 30 cycles using the XL PCR system (Perkin 
Elmer). Amplified products were purified from agarose gels and sequenced 
directly. 

Cloning of an alternative transcript 

A transcript of GABRE that contains sequences from intron 6 was isolated 
from human testes cDNA using the Marathon cDNA amplification system 
(Clontech). Initially, the 5 - and 3 '-ends of several cDNA clones were obtained by 
anchored PCR, using primers that correspond to nucleotides 1 087- 1110 (reverse), 
215-237 (reverse) or 5667-5692 (forward) of SEQ ID NO:l. The terminal 
sequences of the amplified cDNAs were then used to design primers for 
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amplification of a contiguous cDNA. These correspond to nucleotides 1-24 
(forward) and 6121-6146 (reverse) of SEQ ID NO:l. Amplification at 95°C for 
45 s, 60 °C for 1 min, 72 °C for 2 min was performed for 40 cycles using the XL 
PCR system (Perkin Elmer). Amplified products were purified from an agarose 
gel and sequenced directly. 

Northern blot analysis 

Fragments of GABRE were generated by PCR and labeled with 32 P using 
the NEblot random priming system (New England Biolabs). Probe A was derived 
from exon 9 (nucleotides 8620-9626 of SEQ ID NO:5). Probe B was derived 
from the human e subunit cDNA, and corresponds to exons 4, 5 and 6 of GABRE 
(nucleotides 390-780 of U66661 (SEQIDNO:41)). Probes C and E were derived 
from intron 6 (nucleotides 3959-4934 and 5757-6352 of SEQ ID NO:5, 
respectively). The quality, and relative quantities of mRNA samples was assessed 
using a 32 P-labeled fragment of the human GAPD cDNA (nucleotides 789-1 140, 
Tokunaga, K., et al } Cancer Res. 47:5616-5619 (1987)). All probes were 
hybridized sequentially with the same human RNA blots (Clontech) according to 
the manufacturer's instructions. Blots were washed at 60°C in O.lxSSC, 0.1% 
SDS prior to exposure. Between each hybridization, the blots were stripped of 
probe by boiling in 0.5% SDS. 

Chromosomal localization 

A human/rodent somatic cell hybrid mapping panel (Coriell) was used to 
assign GABRE to the X-chromosome. A fragment of intron 6 was amplified from 
200 ng DNA by PCR. The primers corresponded to nucleotides 6886-6910 
(forward) and 7094-71 1 8 (reverse) of SEQ ID NO:5. Amplification at 95 °C for 
45 s, 60°C for 1 min, 72°C for 2 min was performed for 30 cycles using the 
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GeneAmp system (Perkin Elmer). Localization to chromosome Xq28 was 
achieved by the same method, using DNA from somatic cell hybrids (GM07298, 
GM1 11000 and GM10663; Coriell), and YAC clones (IB0502 and CH0479; 
Rogner, U.C., et al } Hum. Mol Genet 3:2137-2146 (1994)). For additional 
analysis of YAC clones, a fragment of GABRE intron 1 was amplified with 
primers that correspond to nucleotides 162-185 (forward) and 463-486 (reverse) 
of SEQ ID NO:3 . A fragment of GABRE exon 9 was amplified with primers that 
correspond to nucleotides 8139-8162 (forward) and 8250-8273 (reverse) of SEQ 
ID NO:5. Fragments from the 5' and 3 ? regions of GABRA3 were amplified with 
primer pairs that correspond respectively to nucleotides 47-70 (forward), 1 26- 1 49 
(reverse) and 1 423 - 1 446 (forward), 1 5 5 1 - 1 574 (reverse) of the human oc3 subunit 
cDNA(GenBank Accession No. S62908). Amplified products were purified from 
agarose gels and sequenced directly to confirm their identity. 

Sequence analysis 

Nucleotide sequences were analyzed with algorithms of the Wisconsin 
Package (Genetics Computer Group). Human and rat genomic sequences were 
aligned using the default parameters of GAP. Conserved regions of the two 
sequences were highlighted graphically using PLOTSIMIL ARITY with a window 
of 150 nucleotides. Potential secondary structures of the ET2 transcript were 
identified using FOLDRNA. The program was used to identify potential stem 
loops within the first 2 kb of the transcript. The minimum free energy of 
individual stem loops was calculated using the same program. 
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Results and Discussion 

Gene Structure and Chromosomal Location 

GABRE was identified within the overlapping inserts of four phage X 
clones (FIG. 1). The subunit protein is encoded by nine exons that span 22 kb. 
The exon-intron junctions sequences of GABRE are shown in Table 2. Exon and 
intron sequences are written in uppercase and lowercase letters respectively. The 
length of each exon and intron is also indicated. The numbering of exon 
sequences in Table 2 corresponds to a full-length cDNA that encodes the human 
e subunit (GenBank Accession N0.U6666I (SEQ ID NO:41)). 

All of the exons are flanked by canonical acceptor and donor splice sites 
(Breathnach, R., and Chambon, P.,Annu. Rev. Biochem. 50:349-383 (1981); 
Table 2). The locations of these splice sites are conserved precisely with all other 
members of this subunit gene family for which complete gene structures have been 
resolved (Sommer, B., et al, Cell Biol 9:561-568 (1990); Kirkness, E.F., et al, 
Genomics 70:985-995 (1991); Lasham, A., et al, J. Neurochem. 57:352-355 
( 1 99 1 )). The nucleotide sequence of the spliced exons is identical to the sequence 
of a human e subunit cDNA clone (Davies, P.A., et al, Nature 555:820-823 
(1997)), with the exception of a single base substitution within exon 3, 
corresponding to base 344 of the cDNA sequence. This substitution (T for G) 
changes the deduced amino acid sequence by replacing Ala 102 with Ser. This 
residue is not conserved between different classes of GABA A receptor subunits, 
and the functional consequences of this apparent polymorphism are unknown at 
present. 
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GABRE was localized to chromosome Xq28 by amplification of gene 
fragments from somatic cell hybrids (Table 3). Most human GABA A receptor 
subunit genes are located in clusters on chromosomes 4, 5 and 15 (Buckle, V.J., 
etal, Neuron 5:647-654 (1989); Wilcox, A.S., etal, Proc. Natl. Acad. Sci. USA 
SP:5857-5861 (1992); Knoll, J.H., et al, Hum. Mol Genet. 2:183-189 (1993); 
Russek, S.J., and Farb, D.H., Genomics 23:528-533 (1994); Hicks, A.A., etal, 
Genomics 20:285-288 (1994); McLean, P.J., etal, Genomics 2(5:580-586 (1995); 
Greger, V., et al, Genomics 25:258-264 (1995)). However, the Xq28 band 
contains a single representative of the family (GABRA3; Buckle, V.J., et al, 
Neuron 5:647-654 (1 989)). Overlapping YAC clones that contain GABRA3 were 
therefore examined for the presence of GABRE. The detection of GABRE on 
clone IB0502, but not CH0479 (Table 3), demonstrates that GABRE is located 
proximal to, and within 950 kb of GABRA3 (see FIG. 3 of Rogner, U.C., etal, 
Hum. Mol Genet. 3:2137-2146 (1994)). The three known subunit gene clusters 
consist of at least one member from each of the a, p, and y subunit classes. It is 
therefore likely that these gene clusters have duplicated and dispersed as a single 
unit. In this respect, it is of interest to note that the human e subunit is most 
similar to the y subunit class (Davies, P.A., et al, Nature 555:820-823 (1997)). 
It is therefore possible that a P-like subunit is also located within the gene cluster 
on Xq28 (e.g. a homologue of the avian P4 subunit; Bateson, A.N., et al, J. 
Neurochem. 55:1437-40 (1991)). The relative orientations of subunit genes on 
chromosome 15 (Greger, V., etal, Genomics 25:258-264 (1995)) would predict 
that such a p-like subunit would be located distal to GABRA3. 
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Table 3 

Physical linkage of GABRE AND GABRA3 on chromosome Xq28 


Cell line/YAC clone 


X chromosome fragment 


GABRE GABRA3 
(5') (3') (5') (3 1 ) 


GM07298 


Xq 


+ + + + 
+ + + + 

+ + - + 
- - + - 


GM11100 


Xq26-28 


GM 10663 


Xp-Xq27 


IB0502 


950 kb of Xq28 


CH0479 


410 kb of Xq28 



Transcripts derived from GABRE 

Sequencing of the cloned gene revealed that several expressed sequence 
tags are derived from intron 6 of GABRE {e.g. GenBank Accession Nos. C 1 7228 
and R07942). This information led to the isolation of a 6.2 kb cDNA clone from 
human testes, termed ET2. The ET2 cDNA is composed of sequences from 
intron 6 and exons 7-9 of GABRE (see FIG. 2). 

A series of northern hybridizations were performed with 32 P-labeled 
fragments of GABRE to determine if the ET2 and e subunit cDNAs were derived 
from the same primary transcript, or if they represent distinct RNA species. Each 
of the cDNAs contain exon 9 of GABRE, and a fragment of this exon was used 
to probe blots of human mRNA from heart, brain, placenta, lung, liver, skeletal 
muscle, kidney, pancreas, spleen, thymus, prostate, testis, ovary, small intestine, 
and colon (probe A). This probe detected a transcript of 3 .6 kb that is expressed 
abundantly in the heart. The size of this mRNA is similar to that reported for an 
e subunit transcript in the subthalamic nucleus (3.2 kb; Davies, P. A., etal, Nature 
555:820-823 (1997)). Although there have been reports of functional GABA A 
receptors in rat heart (Matsuyama, S., et al. s Am. J. Physiol 264:1051-1061 
(1993); McLemore, G.L., et al, Pharmacology 4P:342-350 (1994)), little is 
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known of the subunit composition or pharmacological characteristics of these 
receptors. Probe A also hybridized with a larger transcript (6.2 kb) that was 
detected in a wide range of tissues. Particularly high levels of the 6.2 kb transcript 
are expressed in placenta, prostate, small intestine and colon. Larger species of 
mRNA (7.9 and 10.8 kb) that hybridize with probe A were also detected in heart, 
placenta and prostate. 

A different pattern of hybridizing mRNAs was detected with probe B 
(exons 4, 5 and 6 of GABRE). In common with the previous results, probe B 
hybridized to transcripts of 3.6, 7.9 and 10.8 kb (heart, placenta and prostate). 
However, a major 6.2 kb transcript was not detected in tissues such as small 
intestine and colon. Although a faint band of 6.2 kb was detected in heart, 
placenta and prostate, its intensity (relative to the 7.9 kb and 10.8 kb transcripts) 
indicate that it represents an mRNA species distinct from the abundant 6.2 kb 
transcript which hybridized to probe A. The absence of a major 6.2 kb transcript 
from all tissues demonstrates that this mRNA species does not contain sequences 
of the three exons that are located upstream of intron 6. 

In order to confirm that the major 6.2 kb transcript corresponds to the ET2 
cDNA, a fragment of intron 6 was also used to probe the blots (probe C). The 
hybridization pattern was identical to that obtained for probe A, except for the 
absence of the 3.6 kb transcript in heart. The same result was obtained using a 
probe from a different region of intron 6 (probe E). 

Together, the hybridization data indicate that only the 3.6 kb transcript is 
capable of encoding a frill-length e subunit protein. All of the larger mRNAs 
contain sequences from intron 6. The 7.9 kb and 10.8 kb mRNAs also contain 
sequences from upstream of intron 6, and are found only in tissues that express a 
3.6 kb species. These large mRNAs may therefore represent partially spliced 
products of the primary e subunit transcript. However, the major 6.2 kb transcript 
does not appear to contain these upstream sequences, and its size is similar to the 
ET2 cDNA, which starts within intron 6. It remains possible that transcription of 
the ET2 mRNA is initiated from upstream of intron 6. However, the hybridization 
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data would require that the processing of such a transcript ignores the splice sites 
of exons 4-6 in favor of a cryptic splice site within intron 6. In view of this 
unusual requirement, and the size of the ET2 cDNA, it was concluded that the 
ET2 transcript is most likely to be initiated from within intron 6 of GABRE. 

Analysis of the ET2 cDNA Sequence 

The three forward reading frames of the ET2 cDNA contain stop codons 
within the first 60 bases. The following 3.8 kb contains only short open reading 
frames (ORFs). These are less than 450 bases in length, or less than 260 bases if 
initiated by a methionine codon. Within the entire cDNA, the longest ORF that 
is initiated by a methionine codon (726 bases) starts at base 3872, and includes 
exons 7-9 of GABRE. This ORF encodes the C-terminal half of the e subunit 
protein (residues 245-506 (residues 1-242 in SEQ ID NO:2)), including the four 
putative transmembrane domains. However, owing to the length of sequence that 
precedes this ORF (3.8 kb), its translation by normal cap-dependent mechanisms 
is considered unlikely. 

All ORFs (> 300 bases) were searched against databases of known DNA 
and protein sequences. The entire cDNA sequence, and its six-frame translations 
were also searched. With the exception of the ORF that encodes the e subunit 
fragment (see above), no other ORFs displayed significant homology to known 
proteins. 

It is possible that the 5' region of the ET2 cDNA encodes a short protein 
of novel sequence. In order to identify potential locations for such an ORF, the 
ET2 cDNA sequence was examined for regions that are highly conserved between 
human and rat. Flanking regions of GABRE were included for comparison (FIG. 
3). This analysis detected eight regions of high similarity (> 80% nucleotide 
identity) between the human and rat genomic sequences. Six of these regions 
correspond to exons of the € subunit. The two other segments are contained 
within intron 6 of GABRE, and represent sequences of the ET2 cDNA (A and B 
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of FIG. 3). Notably, region A is located close to the 5' end of the ET2 cDNA 
(nucleotides 12-108 in SEQ ID NO:l). However, this region contains stop 
codons in all three reading frames. The longest ORF that could be initiated from 
within this region encodes a peptide of only 21 amino acid residues. Region B 
also contains stop codons in all three reading frames. Similarly, these stop codons 
flank only short ORFs (< 120 bases) that have no significant homology to known 
peptides. 

Clearly, the protein-coding potential of the conserved regions is 
questionable at present. However, an analysis of potential mRNA folding patterns 
indicates that regions A and B (FIG. 3; nucleotides 12-96 and 1065-1 146 in SEQ 
ID NO: 1) may perform a distinct functional role. Both regions are predicted to 
adopt stable stem-loop structures. Indeed, within the 5' region of the ET2 cDNA 
(nucleotides 1-2000 in SEQ ID NO:l), these two folds exhibit the greatest 
stability of all predicted stem-loops from the region (each -36 kcal/mol). Such a 
strong secondary structure within region A of the transcript would be predicted 
to obstruct a cap-dependent scanning mechanism of translation initiation (Kozak, 
M., MoL Cell Biol P:5134-5142 (1989); Vega Laso, M.R., etal, J. Biol Chem. 
268:6453-6462 (1993)). However, aprocess of internal ribosome entry, that can 
override such a block of ribosome scanning, has been described for several 
eukaryotic transcripts (Oh, S.K. and Sarnow, P., Curr. Opin. Genet. Dev. 5:295- 
300 (1993); Iizuka, N., et al, Curr. Top. Microbiol. Immunol 203:155-177 
(1995)). If the ET2 mRNA is capable of directing internal ribosome entry, it 
remains possible that a truncated e subunit could be translated from the transcript. 
Future studies will explore this possibility by monitoring the expression of a 
reporter gene after fusion within the ET2 cDNA. In addition, antisera that 
recognize the C-terminal of the e subunit protein will be used to probe cell 
extracts after transfection of the full-length ET2 cDNA. 



-82- 

Example 2: Expression and Purification ofET2 and GABRE in 
E. coli 

The bacterial expression vector pQE9 (pDIO) is used for bacterial 
expression in this example. (QIAGEN, Inc. , 9259 Eton Avenue, Chatsworth 5 C A, 
91311). pQE9 encodes ampicillin antibiotic resistance ("Amp r ") and contains a 
bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome 
binding site ("RBS"), six codons encoding histidine residues that allow affinity 
purification using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin sold by 
QIAGEN, Inc., supra, and suitable single restriction enzyme cleavage sites. These 
elements are arranged such that an inserted DNA fragment encoding a polypeptide 
expresses that polypeptide with the six His residues (i.e., a "6 X His tag")) 
covalently linked to the amino terminus of that polypeptide. 

The DNA sequence encoding the full-length ET2 protein shown in SEQ 
ID NO:2 is amplified essentially as described in Example 1 using PCR 
oligonucleotide primers which anneal to the nucleotide sequences encoding the 
desired portion of the ET2 protein. The DNA sequence encoding the mature 
GABRE protein, lacking the hydrophobic leader sequence, is amplified from the 
deposited cDNA clone using PCR oligonucleotide primers which anneal to the 
amino terminal sequences encoding the desired portion of the GABRE protein and 
to sequences in the deposited construct 3' to the cDNA coding sequence. In each 
instance, additional nucleotides containing restriction sites to facilitate cloning in 
the pQE9 vector are added to the 5' and 3' primer sequences, respectively. One 
of ordinary skill in the art would appreciate, of course, that the point in the protein 
coding sequence where the 5' and 3' primers begin may be varied to amplify a 
DNA segment encoding any desired portion of the complete ET2 or GABRE 
protein. 

The amplified ET2 or GABRE DNA fragment and the vector pQE9 are 
digested with suitable restriction enzymes and the digested DNAs are then ligated 
together. Insertion of the ET2 or GABRE DNA into the restricted pQE9 vector 
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places the protein coding region downstream from the IPTG-inducible promoter 
and in-frame with an initiating AUG and the six histidine codons. 

The ligation mixture is transformed into competent E. coli cells using 
standard procedures such as those described in Sambrook et al, Molecular 
Cloning: a Laboratory Manual 2nd Ed.; Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY (1989). E. coli strain M15/rep4, containing multiple 
copies of the plasmid pREP4, which expresses the lac repressor and confers 
kanamycin resistance ("Kan r "), is used in carrying out the illustrative example 
described herein. This strain, which is only one of many that are suitable for 
expressing either the ET2 or GABRE protein, is available commercially from 
QIAGEN, Inc., supra. Transformants are identified by their ability to grow on LB 
plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from 
resistant colonies and the identity of the cloned DNA confirmed by restriction 
analysis, PCR and DNA sequencing. 

Clones containing the desired constructs are grown overnight ( m O/N m ) in 
liquid culture in LB media supplemented with both ampicillin (100 (ig/ml) and 
kanamycin (25 |^g/ml). The O/N culture is used to inoculate a large culture, at a 
dilution of approximately 1 :25 to 1:250. The cells are grown to an optical density 
at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-b-D- 
thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to 
induce transcription from the lac repressor sensitive promoter, by inactivating the 
lad repressor. Cells subsequently are incubated further for 3 to 4 hours. Cells 
then are harvested by centrifugation. 

The cells are then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH8. 
The cell debris is removed by centrifugation, and the supernatant containing the 
ET2 or GABRE is loaded onto a nickel-nitrilo-tri-acetic acid ("NiNTA") affinity 
resin column (available from QIAGEN, Inc., supra). Proteins with a 6 x His tag 
bind to the NI-NTA resin with high affinity and can be purified in a simple one- 
step procedure (for details see: The QIAexpressionist, 1995, QIAGEN, Inc., 
supra). Briefly the supernatant is loaded onto the column in 6 M guanidine-HCl, 
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pH8, the column is first washed with 1 0 volumes of 6 M guanidine-HCl, pH8, then 
washed with 10 volumes of 6 M guanidine-HCl pH6, and finally the ET2 or 
GABRE is eluted with 6 M guanidine-HCl, pH5. 

The purified protein is then renatured by dialyzing it against phosphate- 
buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. 
Alternatively, the protein can be successfully refolded while immobilized on the 
Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl 
pH7.4, containing protease inhibitors. The renaturation should be performed over 
a period of 1 .5 hours or more. After renaturation the proteins can be eluted by the 
addition of 250 mM imidazole. Imidazole is removed by a final dialyzing step 
against PBS or 50 mM sodium acetate pH6 buffer plus 200 mM NaCl. The 
purified protein is stored at 4°C or frozen at -80°C. 

Example 3: Cloning and Expression ofET2 and GABRE Protein 
in a Baculovirus Expression System 

In this illustrative example, the plasmid shuttle vector pA2 GP is used to 
insert the cloned DNA encoding the ET2 protein into a baculovirus to express the 
full-length ET2, using a baculovirus leader. Further, the plasmid shuttle vector 
pA2 is used to insert the cloned DNA encoding the complete protein, including 
its naturally associated secretary signal (leader) sequence, into a baculovirus to 
express the mature GABRE protein. In each case, standard methods described in 
Summers et ah, A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures, Texas Agricultural Experimental Station Bulletin No. 1 555 
(1 987) are employed. 

The pA2 GP and pA2 expression vectors contain the strong polyhedrin 
promoter of the Autographa californica nuclear polyhedrosis virus (AcMNPV) 
followed by the secretory signal peptide (leader) of the baculovirus gp67 protein 
and convenient restriction sites such as BamHl, Xbal and Aspl\%. The 
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polyadenylation site of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmid contains the 
beta-galactosidase gene from E. coli under control of a weak Drosophila promoter 
in the same orientation, followed by the polyadenylation signal of the polyhedrin 
gene. The inserted genes are flanked on both sides by viral sequences for cell- 
mediated homologous recombination with wild-type viral DNA to generate viable 
virus that expresses the cloned polynucleotide. 

Many other baculovirus vectors could be used in place of the vector above, 
such as pAc373, pVL941 and pAcIMl, as one skilled in the art would readily 
appreciate, as long as the construct provides appropriately located signals for 
transcription, translation, secretion and the like, including a signal peptide and an 
in-frame AUG as required. Such vectors are described, for instance, in Luckow 
et aU Virology 170:31 -39. 

The cDNA sequence encoding the full-length ET2 protein shown in SEQ 
ID NO:2, is amplified essentially as described in Example 1 using PCR 
oligonucleotide primers corresponding, for example, to the 5' and 3' sequences of 
the ET2 coding sequences. The DNA sequence encoding the full-length GABRE 
protein, including the hydrophobic leader sequence, is amplified from the 
deposited cDNA clone using PCR oligonucleotide primers which anneal to the 
amino terminal sequences encoding the GABRE protein and to sequences in the 
deposited construct 3' to the cDNA coding sequence. In each instance, additional 
nucleotides containing restriction sites to facilitate cloning in the pQE9 vector are 
added to the 5' and 3 r primer sequences, respectively. One of ordinary skill in the 
art would appreciate, of course, that the point in the protein coding sequence 
where the 5' and 3' primers begin may be varied to amplify a DNA segment 
encoding any desired portion of the complete ET2 or GABRE protein. 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The 
fragment then is digested with suitable restriction enzymes and again is purified 
on a 1% agarose gel. This fragment is designated herein "Fl". 
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The plasmid is also digested with the restriction enzymes suitable for the 
generation of "sticky" ends which will anneal with those of the amplified ET2 or 
GABRE coding sequence. Optionally, the plasmid can be dephosphorylated using 
calf intestinal phosphatase, using routine procedures known in the art. The DNA 
is then isolated from a 1% agarose gel using a commercially available kit 
("Geneclean" BIO 1 0 1 Inc., La Jolla, Ca.). This vector DNA is designated herein 
"VI". 

Fragment Fl and the dephosphorylated plasmid VI are ligated together 
with T4 DNA ligase. E. coli HB101 or other suitable E. coli hosts such as XL-1 
Blue (Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the 
ligation mixture and spread on culture plates. Bacteria are identified that contain 
the plasmid with the human ET2 or GABRE gene using the PCR method, in which 
one of the primers that is used to amplify the gene and the second primer is from 
well within the vector so that only those bacterial colonies containing the ET2 or 
GABRE gene fragment will show amplification of the DNA. The sequence of the 
cloned fragment is confirmed by DNA sequencing. These plasmids are designated 
herein pBacET2 and pBacGABRE. 

Five |iig of the plasmid pBacET2 or pBacGABRE is co-transfected with 
1 .0 \xg of a commercially available linearized baculovirus DNA ("BaculoGold™ 
baculovirus DNA", Pharmingen, San Diego, CA.), using the lipofection method 
described by Feigner et al, Proa Natl. Acad Sci. USA 84:1413-1411 (1987). 
One jig of BaculoGold™ virus DNA and 5 \xg of the plasmid pBacET2 or 
pBacGABRE are mixed in a sterile well of a microtiter plate containing 50 \x\ of 
serum-free Grace's medium (Life Technologies Inc., Rockville, MD). Afterwards, 
10 \x\ Lipofectin plus 90 \i\ Grace's medium are added, mixed and incubated for 
15 minutes at room temperature. Then the transfection mixture is added drop- 
wise to Sf9 insect cells (ATCC CRL 1711) seeded in a 35 mm tissue culture plate 
with 1 ml Grace f s medium without serum. The plate is rocked back and forth to 
mix the newly added solution. The plate is then incubated for 5 hours at 27 °C. 
After 5 hours the transfection solution is removed from the plate and 1 ml of 
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Grace's insect medium supplemented with 10% fetal calf serum is added. The 
plate is put back into an incubator and cultivation is continued at 27 °C for four 
days. 

After four days the supernatant is collected and a plaque assay is 
performed, as described by Summers and Smith, supra. An agarose gel with 
"Blue Gal" (Life Technologies Inc., Rockville, MD) is used to allow easy 
identification and isolation of gal-expressing clones, which produce blue-stained 
plaques. (A detailed description of a "plaque assay" of this type can also be found 
in the user f s guide for insect cell culture and baculovirology distributed by Life 
Technologies Inc., Rockville, MD, page 9-10). After appropriate incubation, blue 
stained plaques are picked with the tip of a micropipettor (e.g. , Eppendorf). The 
agar containing the recombinant viruses is then resuspended in a microcentrifuge 
tube containing 200 jal of Grace's medium and the suspension containing the 
recombinant baculovirus is used to infect Sf9 cells seeded in 35 mm dishes. Four 
days later the supernatants of these culture dishes are harvested and then they are 
stored at 4°C. The recombinant viruses are called V-ET2 and V-GABRE. 

To verify the expression of the ET2 or GABRE gene, Sf9 cells are grown 
in Grace's medium supplemented with 10% heat inactivated FBS. The cells are 
infected with the recombinant baculovirus V-ET2 or V-GABRE at a multiplicity 
of infection ("MOI") of about 2. Six hours later the medium is removed and is 
replaced with SF900 II medium minus methionine and cysteine (available from 
Life Technologies Inc., Rockville, MD). If radiolabeled proteins are desired, 42 
hours later, 5 jaCi of 35 S-methionine and 5 \xd 35 S-cysteine (available from 
Amersham) are added. The cells are further incubated for 1 6 hours and then they 
are harvested by centrifugation. The proteins in the supernatant as well as the 
intracellular proteins are analyzed by SDS-PAGE followed by autoradiography (if 
radiolabeled). Microsequencing of the amino acid sequence of the amino terminus 
of purified protein may be used to determine the amino terminal sequence of the 
full-length protein and thus the cleavage point and length of the secretory signal 
peptide. 
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Example 4: Cloning and Expression of ET2 and GABRE in 
Mammalian Cells 

A typical mammalian expression vector contains the promoter element, 
which mediates the initiation of transcription of mRNA ? the protein coding 
sequence, and signals required for the termination of transcription and 
polyadenylation of the transcript. Additional elements include enhancers, Kozak 
sequences and intervening sequences flanked by donor and acceptor sites for RNA 
splicing. Highly efficient transcription can be achieved with the early and late 
promoters from SV40, the long terminal repeats (LTRS) from Retroviruses, e.g. , 
RSV, HTLVI, fflVI and the early promoter of the cytomegalovirus (CMV). 
However, cellular elements can also be used {e.g., the human actin promoter). 
Suitable expression vectors for use in practicing the present invention include, for 
example, vectors such as PSVL and PMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146) and pBC12MI (ATCC 
671 09). Mammalian host cells that could be used include, human HeLa 293, H9 
and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and CV 1, quail 
QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) cells. 

Alternatively, the gene can be expressed in stable cell lines that contain the 
gene integrated into a chromosome. The co-transfection with a selectable marker 
such as dhfr, gpt, neomycin, or hygromycin allows the identification and isolation 
of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is useful to 
develop cell lines that carry several hundred or even several thousand copies of the 
gene of interest. Another useful selection marker is the enzyme glutamine 
synthase (GS) (Murphy etal, Biochem. J. 227:211-219 (1 991); Bebbington etal, 
Bio/Technology 70.169-175 (1992)). Using these markers, the mammalian cells 
are grown in selective medium and the cells with the highest resistance are 
selected. These cell lines contain the amplified gene(s) integrated into a 
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chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

The expression vectors pCl and pC4 contain the strong promoter (LTR) 
of the Rous Sarcoma Virus (Cullen et al, Molec. Cell Biol 5:438-447 (1985)) 
plus a fragment of the CMV-enhancer (Boshart et al, Cell 47:521-530 (1985)). 
Multiple cloning sites, e.g., with the restriction enzyme cleavage sites BamUl, 
Xbal and Asp7l8, facilitate the cloning of the gene of interest. The vectors 
contain in addition the 3 f intron, the polyadenylation and termination signal of the 
rat preproinsulin gene. 

Example 4(a): Cloning and Expression in COS Cells 

The expression plasmids, pET2-HA and pGABRE-HA, are made by 
cloning a cDNA encoding ET2 or GABRE into the expression vector 
pcDNAI/Amp or pcDNAIII (which can be obtained from Invitrogen, Inc.). 

The expression vector pcDNAI/amp contains: (1) an E. coli origin of 
replication effective for propagation in E. coli and other prokaryotic cells; (2) an 
ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) 
an SV40 origin of replication for propagation in eukaryotic cells; (4) a CMV 
promoter, a polylinker, an SV40 intron; (5) several codons encoding a 
hemagglutinin fragment {i.e., an "HA" tag to facilitate purification) followed by 
a termination codon and polyadenylation signal arranged so that a cDNA can be 
conveniently placed under expression control of the CMV promoter and operably 
linked to the SV40 intron and the polyadenylation signal by means of restriction 
sites in the polylinker. The HA tag corresponds to an epitope derived from the 
influenza hemagglutinin protein described by Wilson et al, Cell 37:767-778 
(1984). The fusion of the HA tag to the target protein allows easy detection and 
recovery of the recombinant protein with an antibody that recognizes the HA 
epitope. The pcDNAIII vector contains, in addition, the selectable neomycin 
marker. 
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A DNA fragment encoding the ET2 or GABRE is cloned into the 
polylinker region of the vector so that recombinant protein expression is directed 
by the CMV promoter. The plasmid construction strategy is as follows. ET2 
cDNA is amplified essentially as described in Example 1 using 5' and 3 ! primers 
which contain convenient restriction sites. The DNA sequence encoding the 
full-length GABRE protein, including the hydrophobic leader sequence, is 
amplified from the deposited cDNA clone using PCR oligonucleotide primers 
which anneal to the amino terminal sequences encoding the GABRE protein and 
to sequences in the deposited construct 3' to the cDNA coding sequence. 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 
digested with suitable restriction enzymes and then ligated. The ligation mixture 
is transformed into E. coli strain SURE (available from Stratagene Cloning 
Systems, 11099 North Torrey Pines Road, La Jolla, CA 92037), and the - 
transformed culture is plated on ampicillin media plates which then are incubated 
to allow growth of ampicillin resistant colonies. Plasmid DNA is isolated from 
resistant colonies and examined by restriction analysis or other means for the 
presence of the ET2 or GABRE-encoding fragment. 

For expression of recombinant ET2 or GABRE, COS cells are transfected 
with an expression vector, as described above, using DEAE-DEXTRAN, as 
described, for instance, in Sambrook et al, Molecular Cloning: a Laboratory 
Manual Cold Spring Laboratory Press, Cold Spring Harbor, New York (1989). 
Cells are incubated under conditions for expression of ET2 or GABRE by the 
vector. 

Expression of the ET2-HA and GABRE-HA fusion protein is detected by 
radiolabeling and immunoprecipitation, using methods described in, for example 
Harlow et al, Antibodies: A Laboratory Manual 2nd Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1988). To this end, two days 
after transfection, the cells are labeled by incubation in media containing 35 S- 
cysteine for 8 hours. The cells and the media are collected, and the cells are 
washed and lysed with detergent-containing RIPA buffer: 150mMNaCl, 1%NP- 
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40, 0.1% SDS, 0.5% DOC, 50 mM TRIS, pH 7.5, as described by Wilson etal 
cited above. Proteins are precipitated from the cell lysate and from the culture 
media using an HA-specific monoclonal antibody. The precipitated proteins then 
are analyzed by SDS-PAGE and autoradiography. An expression product of the 
expected size is seen in the cell lysate, which is not seen in negative controls. 

Example 4(b): Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of ET2 and GABRE proteins. 
Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No. 
37146). The plasmid contains the mouse DHFR gene under control of the SV40 
early promoter. Chinese hamster ovary- or other cells lacking dihydrofolate 
reductase activity that are transfected with these plasmids can be selected by 
growing the cells in a selective medium (alpha minus MEM, Life Technologies) 
supplemented with the chemotherapeutic agent methotrexate. The amplification 
of the DHFR genes in cells resistant to methotrexate (MTX) has been well 
documented (see, e.g., Alt, F. W., Kellems, R. M., Bertino, J. R., and Schimke, 
R. T., 1978, J Biol Chem. 253:1357-1370, Hamlin, J. L. and Ma, C. 1990, 
Biochem. et Biophys. Acta, 7097:107-143, Page, M. J. and Sydenham, M.A. 
1 99 1 , Biotechnology P:64-68). Cells grown in increasing concentrations of MTX 
develop resistance to the drug by overproducing the target enzyme, DHFR, as a 
result of amplification of the DHFR gene. If a second gene is linked to the DHFR 
gene, it is usually co-amplified and over-expressed. It is known in the art that this 
approach may be used to develop cell lines carrying more than 1 ,000 copies of the 
amplified gene(s). Subsequently, when the methotrexate is withdrawn, cell lines 
are obtained which contain the amplified gene integrated into one or more 
chromosome(s) of the host cell. 

Plasmid pC4 contains for expressing the gene of interest the strong 
promoter of the long terminal repeat (LTR) of the Rous Sarcoma Virus (Cullen 
et al, Molec. Cell Biol 5:438-447 (1985)) plus a fragment isolated from the 
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enhancer of the immediate early gene of human cytomegalovirus (CMV) (Boshart 
et al, Cell 41:521 -530 (1 985)). Downstream of the promoter are BamHL, Xbal, 
and Aspl 18 restriction enzyme cleavage sites that allow integration of the genes. 
Behind these cloning sites the plasmid contains the 3 ? intron and polyadenylation 
site of the rat preproinsulin gene. Other high efficiency promoters can also be 
used for the expression, e.g., the human p-actin promoter, the SV40 early or late 
promoters or the long terminal repeats from other retroviruses, e.g., HIV and 
HTLVI. Clontech's Tet-Off and Tet-On gene expression systems and similar 
systems can be used to express the ET2 or GABRE in a regulated way in 
mammalian cells (Gossen, M., & Bujard, H. 1 992, Proc. Natl. Acad Set USA 89: 
5547-5551). For the polyadenylation of the mRNA other signals, e.g., from the 
human growth hormone or globin genes can be used as well. Stable cell lines 
carrying a gene of interest integrated into the chromosomes can also be selected 
upon co-transfection with a selectable marker such as gpt, G41 8 or hygromycin. 
It is advantageous to use more than one selectable marker in the beginning, e.g., 
G418 plus methotrexate. 

The plasmid pC4 is digested with the restriction enzymes suitable 
restriction enzymes and then dephosphorylated using calf intestinal phosphatase 
by procedures known in the art. The vector is then isolated from a 1% agarose 
gel. 

The DNA sequence encoding the complete ET2 protein is amplified 
essentially as described in Example 1 using PCR oligonucleotide primers 
corresponding to the 5' and 3 ! sequences of the gene sequences which are to be 
cloned. The DNA sequence encoding the full-length GABRE protein, including 
the hydrophobic leader sequence, is amplified from the deposited cDNA clone 
using PCR oligonucleotide primers which anneal to the amino terminal sequences 
encoding the GABRE protein and to sequences in the deposited construct 3 * to the 
cDNA coding sequence. In each instance, the amplified fragment is digested with 
the endonucleases restriction enzymes which produce suitable "sticky" ends for 
ligation into the pC4 plasmid and then purified again on a 1% agarose gel. The 
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isolated fragment and the dephosphorylated vector are then ligated with T4 DNA 
ligase. E. coli HB101 or XL-1 Blue cells are then transformed and bacteria are 
identified that contain the fragment inserted into plasmid pC4 using, for instance, 
restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
transfection. Five jag of the expression plasmid pC4 is cotransfected with 0.5 jag 
of the plasmid pSV2-neo using lipofectin (Feigner et al, supra). The plasmid 
pS V2neo contains a dominant selectable marker, the neo gene from Tn5 encoding 
an enzyme that confers resistance to a group of antibiotics including G418. The 
cells are seeded in alpha minus MEM supplemented with 1 mg/ml G41 8. After 2 
days, the cells are trypsinized and seeded in hybridoma cloning plates (Greiner, 
Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml of 
metothrexate plus 1 mg/ml G418. After about 10-14 days single clones are 
trypsinized and then seeded in 6-well petri dishes or 10 ml flasks using different 
concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Clones growing at the highest concentrations of methotrexate are then transferred 
to new 6-well plates containing even higher concentrations of methotrexate 
(1 (iM, 2 |iM, 5 jiM, 10 |iM, 20 |nM). The same procedure is repeated until 
clones are obtained which grow at a concentration of 100-200 jaM. Expression 
of the desired gene product is analyzed, for instance, by SDS-PAGE and Western 
blot or by reverse phase HPLC analysis. 

Example 5: Tissue Distribution of ET2 and GABRE mRNA 
Expression 

Northern blot analysis is carried out to examine ET2 and GABRE gene 
expression in human tissues, using methods described by, among others, 
Sambrook et al, cited above. A cDNA probe containing the entire nucleotide 
sequence of the ET2 protein (SEQ ID NO:l) or the GABRE protein (SEQ ID 
NO:41) is labeled with 32 P using the re^/prime™ DNA labeling system 
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(Amersham Life Science), according to manufacturer's instructions. After 
labeling, the probe is purified using a CHROMA SPIN- 1 00™ column (Clontech 
Laboratories, Inc.), according to manufacturer's protocol number PT1200-1 . The 
purified labeled probe is then used to examine various human tissues for ET2 or 
GABRE mRNA. 

Multiple Tissue Northern (MTN) blots containing various human tissues 
(H) or human immune system tissues (IM) are obtained from Clontech and are 
examined with the labeled probe using ExpressHyb™ hybridization solution 
(Clontech) according to manufacturer's protocol number PT1 190-1 . Following 
hybridization and washing, the blots are mounted and exposed to film at -70 °C 
overnight, and films developed according to standard procedures. 

Example 6: Insensitivity to Anaesthetic Agents Conferred by a 
Class of GABA A Receptor Subunit 

At least thirteen different subtypes of GAB A A receptor subunits have been 
identified inmammals and categorized into four structural classes (al -6, 01-3, y 1 - 
3 and 6). These subunits are thought to assemble in different pentameric 
complexes, with most functional receptors containing ce/p/y or a/0/5 subunit 
combinations (Whiting, P. et al, Int. Rev. Neurobiol. 38:95- 1 3 8 ( 1 995)). 

An expressed sequence tag (EST) which encodes a new class of subunit 
was identified by searching an EST database with a peptide consensus sequence 
of known family members. Using the sequence of this EST, flanking cDNA 
sequences were amplified from a hippocampal cDNA library by anchored 
polymerase chain reaction (PCR). The longest contiguous cDNA identified by this 
method is 3. 1 kilobases in length and contains a large open reading frame which 
encodes a polypeptide of 506 amino-acid residues (FIG. 4 (SEQ ID NO:41)). 
This amino-acid sequence (SEQ ID NO:42) is most closely related to GABA A 
receptor y-subunits (38-43% amino-acid identity) and a-subunits (28-30%) and 
has less similarity to other GABA and glycine receptor subunits (<25%). The 
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maximum levels of sequence identity are similar to those found between the 
different classes of GABA A receptor subunits (Whiting, P. et ah, Int. Rev. 
Neurobiol 38:95- 1 3 8 (1 995)), and justified its categorization in a distinct subunit 
class (e). 

Northern analyses were used to examine the expression of e-subunit 
mRNA in various tissues. Two species of RNA (2.8 and 3.2 kb) were readily 
detectable in a number of brain regions using probes derived from either the 3'- or 
5 f -ends of the e-subunit cDNA. While these transcripts were barely detectable in 
samples of whole brain, they were relatively enriched in amygdala and thalamus, 
and were particularly abundant in the subthalamic nucleus. The subthalamic 
nucleus is known to receive major GABA-mediated inputs, with particularly dense 
projections arising from the globus pallidus (Bell, K. et ah, Synapse 20:10-18 
(1995)). Notably, the relatively low levels of y- and S-subunit transcripts in rat 
subthalamic nucleus (Wisden, W. et al, J. Neuroscl 72:1040-1062 (1992)) are 
consistent with expression of alternative receptor subunits in this tissue. 

The biophysical and pharmacological properties conferred to GABA A 
receptors by the e-subunit were examined after transient expression in HEK-293 
cells. Cells transfected with only the e-subunit cDNA did not express binding sites 
of the GABA A receptor ligands [ 3 H]muscimol or [ 35 S] /-butyl 
bicyclophosphorothionate (TBPS). These cells also failed to exhibit chloride 
currents in response to GAB A or glycine (each 100 ^M). The e-subunit is 
therefore unlike the p class of receptor subunits (Whiting, P. et aL, Int. Rev. 
Neurobiol. 35:95-138 (1995)), or the a class of glycine-receptor subunits 
Schmieden, V. et aL, EMBO J. 11 :2025-2032 (1 992)), which can each assemble 
homomeric chloride channels that are gated by these ligands. When cells were co- 
transfected with cDNAs encoding the e-subunit, and either an a-subunit (al, a2 
or a6) or a P-subunit (pi or p3), there was also a failure to detect expression of 
any ligand-binding activities. Therefore the e-subunit does not behave like a 
typical GABA A receptor a- or p-subunit (Whiting, P. et al, Int. Rev. Neurobiol. 
55:95-138 (1995)). As expected, transfection of cells with a combination of a-, 
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P- and e-subunits resulted in the expression of GABA-activated currents (FIG. 
5 A-5C) and [ 35 S]TBPS binding sites (FIG. 6A-6D). However, the transfected cell 
membranes did not bind either [ 3 H]flunitrazepam or [ 3 H]flumazenil. Therefore the 
e-subunit is also unlike the y-class of subunits, which confers a sensitivity to 
benzodiazepines when expressed with a- and p-subunits (Whiting, P. et al, Int. 
Rev. Neurobiol. 38: 95- 1 3 8 ( 1 995)). The remaining subunit class, 8, has no unique 
pharmacological features to compare, but can be distinguished from the e-subunit 
by its sensitivity to barbiturates (Saxena, N. C. and Macdonald, R. L., J. Neurosci. 
14: 7077-7086 (1994) and see below). 

There was little difference between the concentration-response 
relationships for GABA-activated chloride currents recorded from cells transfected 
with a2/pl or a2/pl/e cDNAs (FIG. 5A). The selective GABA A receptor 
inhibitors picrotoxin (10 uM) and bicuculline ( 1 0 uM) blocked currents mediated 
by the oc2/pi/e subunit combination by 82.5 ± 7.4% (N= 3) and 73.6 ± 9.3% (N= 
4), respectively. However, a comparison of current- voltage relationships showed 
that e-subunits can assemble with a- and p-subunits and confer new biophysical 
properties. Previous studies have shown that native and recombinant GABA A 
receptors exhibit apronounced outward rectification (Verdoorn, T. etal, Neuron. 
4:919-928 (1992); Davies, P. etal.,Br. J. Pharmacol. 120:899-909 (1997); Gray, 
R. and Johnston, D., J. Neurophysiol. 54:134-142 (1985); and Segal, M. et al, 
J. Neurophysiol. 57:500-515 (1984)). Here this was confined with cells 
expressing a2/pl or al/p3 subunits, which displayed larger GABA-evoked 
currents when clamped at positive potentials than at the corresponding negative 
values (FIG. 5B and FIG. 5C). In contrast, inclusion of the e-subunit with either 
subunit combination yielded currents which exhibit a linear relationship to voltage 
(FIG. 5B). The cause of outward rectification at GABA A receptors is 
controversial (Gray, R. and Johnston, D., J. Neurophysiol. 54:134-142 (1985); 
Segal, M. et al, J. Neurophysiol. 57:500-515 (1984); Bormann, J. et al, J. 
Physiol. (Lond.) 555:243-286 (1987); and Gage, P. W. and Chung, S. H., Proc. 
R. Soc. Lond. B255: 1 67- 1 72 (1 994)), and any of the proposed mechanisms could 
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be influenced by inclusion of the e-subunit. However, the e-subunit displays an 
unusually high density of basic residues close to the M3 segment (residues 376- 
40 1) which may affect rectification properties by sequestering chloride ions near 
the intracellular face of the ion channel. 

Having shown that e-subunits can assemble with other GABA A receptor 
subunits, its effects on receptor pharmacology was examined. Unexpectedly, the 
presence of the e-subunit had a pronounced effect on the sensitivity of GABA A 
receptors to intravenous anaesthetic agents. This was shown by both radioligand 
binding and by measurement of GABA-evoked currents (FIG. 6A-6D). First, in 
common with studies on other a/p subunit combinations (Davies, P. et aL, Br. J. 
Pharmacol 720:899-909 (1997) and Korpi, E. R. and Luddens, H. 5 Mol 
Pharmacol 44:87-92 (1993)), the anaesthetic steroid, 5a-pregnan-3cc-ol-20-one, 
was found to have a biphasic effect on [ 35 S]TBPS binding to cells expressing 
a2/pl subunits. In contrast, there was no stimulatory phase to the dose-response 
curve when using the a2/pl/e subunit combination (FIG. 6A). The functional 
correlate of this assay examined the effect of pregnanolone (10 nM to 1 |aM) on 
submaximal GABA-activated currents recorded from cells expressing a2/pl or 
a2/pl/e-subunit combinations (FIG. 6B). In the absence ofthe e-subunit, currents 
were potentiated by up to 521%, with a half-maximal effective concentration 
(EC 50 ) value of 53 nM. In contrast, currents recorded from the a2/pl/e subunit 
combination were relatively insensitive to the steroid. No significant effects were 
observed at concentrations up to 100 nM pregnanolone, and only minor effects 
(up to 73% potentiation) were detected at high concentrations ofthe steroid (FIG. 
6B). 

This effect ofthe e-subunit was examined further using different subunit 
combinations and additional anaesthetic agents. The ability of anaesthetic agents 
to stimulate [ 3 H]muscimol binding to cell membranes is a reliable indicator of their 
ability to stimulate GABA-evoked currents recorded from intact cells (Peters, J. 
et al, Br. J. Pharmacol. 94:1257-1269 (1988)). After transfection with cd/p3 
subunits, [ 3 H]muscimol binding to cell membranes was stimulated by up to 60% 
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in the presence of pregnanolone or the anaesthetic agent pentobarbital. However, 
when the e-subunit was expressed with this combination, the maximal stimulation 
dropped to below 15% (FIG. 6C). The dose-response curves for a third 
anaesthetic agent, propofol (2,6-diisopropylphenol), were similar to those of 
pentobarbital (data not shown). Consistent with the effect on ligand binding, 
inclusion of the e-subunit with different a/p subunit combinations blocked the 
ability of pentobarbital (100 pM) and propofol (3 |uM) to enhance GABA-evoked 
currents (FIG. 6D). 

High concentrations of anaesthetic agents can activate GABA A receptors 
in the absence of GABA (Whiting, P. et al, Int. Rev. Neurobiol 55:95-138 
(1995)). Notably, cells expressing a/p/e subunit combinations retained their 
sensitivity to this direct activation. Pregnanolone (10 (iM), propofol ( 1 00 jjM) and 
pentobarbital (1 mM) each evoked picrotoxin-sensitive currents recorded from 
cells expressing o2/pl/€ or ctl/p3/e cDNAs (0.2-2.1 nA, N= 4-6). The loss of 
sensitivity to the potentiating effects, but not the activating effects, of intravenous 
anaesthetic agents supports the position that the two responses are mediated by 
different anaesthetic binding sites on the receptor complex (Sanna E. et al, J. 
Pharmacol Exp. Ther. 274:353-360 (1995)). The 5 subunit has been reported 
to confer analogous properties with respect to anaesthetic steroids (Zhu, W. et al, 
J. NeuroscL 75:6648-6656 (1996)) but not barbiturates (Saxena, N. C. and 
Macdonald, R. L., J. NeuroscL 14: 7077-7086 (1994)). Future use of e- and 5- 
subunit fragments for chimaeric subunit construction may provide insights to the 
structural basis of different anaesthetic actions at GABA A receptors. 

It remains possible that the e-subunit can confer additional 
pharmacological properties to GABA A receptors that have yet to be uncovered. 
In view of its abundant expression in the subthalamic nucleus, identification of 
these properties could have important implications for the treatment of a variety 
of movement disorders, including Parkinson' s disease (Bergman, H. et al. , Science 
249:1436-1438 (1990) and Goetz, C. G. and Diederich, N. J., Nature. Med. 
2:510-514 (1996)). Its restricted expression pattern in the human brain also 
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provides an opportunity for relatively selective manipulation of this important 
nucleus of the basal ganglia. 

Methods 

Isolation of the e-subunit cDNA. Searches of the database dbEST 
(http://www.ncbi.nlm.nih.gov/dbEST/index.html) were made using the TBLASTN 
algorithm. The query sequence used was a consensus polypeptide containing 
amino-acid residues which are invariant between all known mammalian GABA A 
receptor subunits (FIG. 4). Oligonucleotide primers were designed from an EST 
sequence (GenBank accession no. R07883) to amplify 5' and 3' flanking sequences 
from cDNA libraries (Adams, M. et al, Nature (suppl) 377:3-174 (1995)) by 
anchored PCR. Amplification at 95 °C for 45 seconds, 60° C for 60 seconds and 
72° C for 2 minutes was performed for 35 cycles using the XL-PCR system 
(Perkin Elmer). Reaction products were purified from agarose gels and sequenced 
directly. The e-subunit open reading frame was amplified from a hippocampal 
cDNA library as described above using primers containing nucleotides 11-34 
(sense) and 1,569-1,592 (antisense) of the e-subunit cDNA sequence (GenBank 
Accession No. U66661 (SEQ ID NO:41)). The reaction products were ligated 
into pCDM8 (Invitrogen), and the resulting clones were sequenced to ensure that 
no mutations had been introduced. 

Northern blot analysis. Samples of approximately 2 jig poly(A) + RNA 
(Clontech) were electrophoresed on a 1 .2% formaldehyde agarose gel, transferred 
to a nylon membrane and hybridized with a 32 P-labeled fragment of the 3'- 
untranslated e-subunit cDNA (nucleotides 1,939-2,945 in SEQ ID NO:41). The 
blot was washed at 60°C in 0.1 X SSC, 0.1% SDS before exposure and then 
stripped of probe by boiling in 0.5% SDS, and rehybridized with a 32 P-labeled 
fragment from the 5 r end of the e-subunit cDNA (nucleotides 19-347 in SEQ ID 
NO:41). Finally, the same blot was stripped and reprobed with a 32 P-labeled 
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fragment of the human GAPD cDNA (Tokunaga, K. et al, Cancer Res. 47:56 1 6- 
5619 (1987)) (nucleotides 789-1,140 in SEQ ID NO:41). 

Transient transfection of subunit cDNAs. Transfection of human 
embryonic kidney cells (HEK-293) with subunit cDNAs was performed with a 
calcium phosphate-DNA precipitate in HEPES buffer. After incubation with the 
precipitate for 24 hours, the cells were washed and cultured for a further 48-72 
hours before use. The subunit cDNAs used for transfection were: human <x2 and 
pi, cloned in pCIS2; rat al, rat oc6, rat pi, human P3 and human e, cloned in 
pCDM8. 

Radioligand binding assays. Transfected cells were washed twice with 
phosphate-buffered saline and collected by scraping into an ice-cold solution 
containing 10 mM Tris-HCl, 1 mM EDTA, 100 mM NaCl, pH 7.5 (TEN). The 
cells were collected by centrifugation (5,000g, 10 min), hand homogenized in 
TEN, and a crude membrane fraction was obtained by centrifugation (30,000g, 3 0 
min). The membrane pellet was washed twice with TEN, and then resuspended 
in TEN at a protein concentration of ~3 mg ml -1 . The binding of [ 35 S]TBPS (20 
nM; 88.7 Ci mmol" 1 ; NEN) to membranes (50-100 ng protein) was assayed by 
filtration after incubation for 120 min at 25 °C in 1 00 ul of a solution containing 
20 mM Tris-HCl, 1 M NaCl, pH 7.5. Nonspecific binding was determined in the 
presence of picrotoxin (100 uM), and was equal to the binding of mock- 
transfected cell membranes (<0.03 pmol per mg protein). The binding of 
[ 3 H]muscimol (30 nM; 19.5Ci mmor 1 ; NEN) was assayed after incubation of 
membranes for 60 min at 0 °C in 100 pi of TEN. Nonspecific binding was 
determined in the presence of GABA (1 mM). The binding of [ 3 H]flunitrazepam 
and [ 3 H]flumazenil (each 1 -50 nM) was assayed essentially as previously described 
(Buller, A. et al., Mol. Pharmacol. 46:858-865 (1994)). All binding data were 
derived from the mean ±s.d. of three independent transfections. 

Electrophysiology. Cells were supervised (5 ml min 1 ) with a solution of 
140 mM NaCl, 4.7 mM KC1, 1.2 mM MgCl 2 , 2.5 mM CaCl 2 , 1 1 mM glucose, 
and 1 0 mM HEPES, pH 7.4 during whole-cell recordings. The electrode solution 
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contained 140 mM KC1, 2.0 mM MgCl 2 , 1 1 mM EGTA, and 1 0 mM HEPES, pH 
7.4. The electrode solution was supplemented with 100 \xM ATP to prevent 
current rundown in recordings from cells expressing a2/pl and a2/pi/e. When 
examining their agonist action, GAB A, glycine and anaesthetics were applied 
locally, by pressure (70 kPa) ejection from micropipettes situated 50 ^im from the 
cell. For modulation experiments, in which anaesthetics, bicuculline methiodide 
and picrotoxin were bath applied, GAB A ( 1 00 |uM) was ej ected for brief durations 
(5-30 ms) that activated approximately 10% of maximum current amplitude. 
GABA was applied for 1 second from low-resistance pipettes for concentration- 
response experiments (Adodra, S. and Hales, T. G., Br. J. Pharmacol 1 75:953- 
960 (1995)). Currents were filtered (1 kHz) and recorded at - 60 mV (unless 
otherwise stated) using an Axopatch 200A amplifier. Data were simultaneously 
digitized for storage on VCR tapes and the hard drive of an IBM PC. Currents 
were analyzed and reproduced using pCLAMP software (Axon). All 
electrophysiological data are mean ±s.e.m. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are 
possible in light of the above teachings and, therefore, are within the scope of the 
appended claims. 

The entire disclosure of all publications (including patents, patent 
applications, journal articles, laboratory manuals, books, or other documents) 
cited herein are hereby incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Hanna, Michael C. 

Kirkness, Ewen F. 

(ii) TITLE OF INVENTION: GABA A Receptor Epsilon Subunits 

(iii) NUMBER OF SEQUENCES : 4 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox P.L 

(B) STREET: 1100 New York Avenue, NW, Suite 600 

(C) CITY: Washington 
CD) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/888,012 

(B) FILING DATE: 03-JUL-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Steffe, Eric K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE/DOCKET NUMBER: 1488 . 0950001/EKS/SGW 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3872.. 4597 



r. ^3 

m 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GTCTTATAAT TGCTAAGCAC TTACAACTGT TTGCAGAGGA AACTGAGACT TTGTAACTAT 60 

GTCTCAGTCT CATCTGCAAA GAAGTAAGTG CTTTGCCAAG CTCCTTGAGA GGTTAGGTAA 120 

GTAGATAAAG TTCTGCTGCT GTCGGAATGT GCAGCTGGCT TTTTCATGCA GACCCTTCAG 18 0 

TTTCGAGGTT ACAACTCTGA CCTCTTTGGA TGACTTTGGG GAATGGAGCT CGTGTGAGTT 24 0 

CTCCATACCC AGAACCAATC CAGTCTGGTT GAATGGGAAG CAAAGTCCAT TGTAGTGGGA 300 

GGTGGAGGCT AGAGTTCTAA TGTCAGCTAG TTTAAGGCTG GGAAAGTCTG GAGGAAGTTA 360 

CAGCAGCTAC ACTGGCTGCT GCATTGACAT TTATCTTAAA GGAACAAGTC TGAAAAGCAC 420 

AGATTCTTAT CAAAGGCTTC ATGGTGGATT CCACATAGAC ATAGTGGCCA CTGGTTTTCT 480 

GACCTTTTCT CTGACAAAGA CTAAAGGGGA AGGTCCTGGG TATCTTACAC TTCAGCTCCC 54 0* 

AATTAGATGT GAGCACCTTC ACTTATGTTC CTAGGTGACC TGAATGAGGA GCCAAGGGAC 600 

CTCCCCAGGG TAGCTCCCAG AGCAACCCTG GAAACACTCT TCACACATCC TGACCAAGTT 660 

CAGGGCAGTG AAGGCACTGC CCTCATCGTT TCCAGAATGT GGATGGAGCC AGTCACCCAA 720 

CCAGCCATTT GTCGTGAGAG GCATCTTGTT CTGCTACCAT GTGACTAGGC AGAAAATCTG 780 

CTTTTGTTTC AT T TAT T GAG TCAGTCTCTG GATGAGGGAA AGCTCATGCT CATGTGGCTA 840 

GAGCTTTGCT TGCACAGTAT TAGGCAGGGG CAGAGGGCTG GGCTACCTTA AAAATACTTG 900 

CCCTTTTTCT TGGGGACTCT GGGGAAGCGG TTTTACTACC TTTGACTTGG GAGCCTTGCT 960 

CTTCTGCCAG CTAACCATGG GCCTGCCTCT TGGTTTTCTG CACCTCAGCT TTTCCCGGAT 1020 

AGGTGGGGAC CCATCATCAA AAGTGACAGA GAAGATAAGG CCCAGGGGCT TTCAAGTCAC 108 0 

TAGTGGTTCC GTTTAGTAGA TGATTGTGCA TTGTTTCAAA ATGGTGCCCT AG T G AC T AC A 1140 

AAGCCCCAGA GCCAGCATCA TCATCAAAGC AATGACAGTA GGTAAGCACC AGACCTCCTT 1200 

GGGAGTGAGG AGGATTCTTG AG G AG AAAAG AGGTCTTCTT TCTCCTCTGC TGGAGACTAG 12 60 

TTGATCTGGA GACGTGGTTC CTTCAATGTC AGAGTTATCT TTGGGACTGG TCTCAAACTC 132 0 

TTCCAGTTGG GCCCTGGGGC AGGTCTCTCC ATCTGGAGCA TACTTACGTG CTCGGCGATT 1380 

AAGGGTTCAG AATGCAGTGG TAGCCTGCTA CTCTGGCCAT CTTGGACCTT GATCCAGAGA 14 4 0 

ATCTCTGCTT CAGGAGCTTC TAAGAGAGTC CAGCCCTGCC TCCAGAGAGA GGCTTGCCCT 1500 

TCACTGATGG CTGTGGAGCC TCTGATGGAA TATTATTGCT GGTCAGGAAT TCACTGTCTT 15 60 

ACAAGGAGGT TTCCTTCTTC TCTAGACAGT TCTGTTCATC AAAAAACTCT CCCTGTTCTT 1620 

CTGAAATTGG AGTCTCTGGA AGTTCCACAC ATTAAGCTTA GTTCTTTTTC CTTGGAACTG 168 0 

TCCAGGTTAC ATTAGTCCAG CCACTGTTTC ACAGGACCGA GAT TAAACGA T CAACAT CAT 1740 

CATTCCCGGC AT G GAT CAT A GTCTGTTGTA GTCTACATAG CCCTAGTTTA TTTTTCTTCC 18 00 



CTTATTCTTC AAAGCTTTGG GTCCATTCAT 
TATTTAATTG TGTCCCTCTG ACACTGCAGT 
GAGTTTGAGC AGAAAACCAT CTTTAGCATA 
ATATATTGTT TTCCTTACCC GTGCTTCTCT 
GCATCTTTCT CTTGTCCTTC AAGACAAATT 
TTTTGGCCAG AATAAATAAA ATTAAAATAG 
CAACAAGAAT AT AT TAT ATA AACCCAGCAA 
AGTGTCTTAA ATTGCAGTCA AGATCCCACG 
CCAGTAGGCA AGTGCGAGGG TGGTCCGTGC 
TATACTCAAC ACACAGGTTT CCCACTGACA 
AATCTGAGAA GCTTTGCTCC TTGAGTTTCA 

CI 

yp CTTCTCATTC AGCTCCAGCC CACCCAGACC 

f!1 GGAGGACACA CGGACAGGTT AGTGTTCTGG 

2 GCAAGAGTTG CTGAGTGTCT GCTGGGTTCC 

ttj TAAGCCACAG GGCCTGCATG AGCCCTGCCT 

ft i 

s GGCCTAGGAA TTTTACAGTA TTGCAACTGC 

J! CTGGGCTCAG AGAAAAGCCC ACACCAGCCT 

fU GAGGGTGTGT CAGAGCATAA AGCAG CAT GA 

ys • 

yf|| ATAGGTTGTC ATCAGGTCCC GGAGGAGCCA 

ACACAATGTC CAAGTGACTG TGAGGACCTC 
CCTGACTTCT GGAGGGCTGA CTGAAATGTT 
AACACTCAGT TTGGGTTAGG GGTCATAGAC 
GGTGTAAAAT TGCAGGCCAT CTCAGGAAAT 
GGTGCGTGTG GAAAATGTCA GTGGGAGCCG 
AGAGTGCAGA TAGTCTGTAT AGCTCATGTG 
AGGCTAGTGC CGGTTTTGAC CACCTGTGGA 
T C AC AT GAAT GTGCAGAAAG GAGGCCAAAA 
CTTCCCTAGT CTCATTTGCT GAG AG TAG AC 
TGCCTCAGAG GGTATGAAGC CCTTAGTCAG 
ATCTAGAAGG CTAATTGATT TTTTCACTTT 
AGACATTGGG TGTAGGACAG TGAACTGAGA 
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TCTTCTAGTC 


CCAGTCCTCT 


GGACATGGTC 


I860 


GACCAACCAT 


GATCTGGTCA 


AAGAGGATAA 


1920 


TATTTTTTTG 


CTTTGGTTCA 


TCAGCCCCAG 


1980 


CACTCCTCAA 


GAAGAAGAAA 


GTGTGTGTTA 


2040 


GGCATCTCTT 


GACGAGCGGA 


GAAGGTTCTT 


2100 


AATCATCCAA 


CAGAATAATA 


AATCTTCGTG 


2160 


TTTTGCAGGG 


CCTGGGTATA 


ACTAATTAGA 


2220 


GCAAGAGGAC 


TTTTGATAAA 


TACATTCTGG 


2280 


AGCAGCTCTG 


GAGGAGTTCT 


ATCCCAAAGC 


234 0 


ACAGGTCGCT 


CCCTTGCCTT 


CTTCCAGAAG 


2400 


GTGCTGCCAA 


GGTGAGTACG 


AAAGGCTGCT 


24 60 


TGCTGGGCAG 


TTGATCCACT 


TTCCAAAATA 


2520 


TCTGCTTTAC 


AAAGCTGTTG 


CCTGACAGGA 


2580 


AGGCTGTTCT 


GAGCTTGGAT 


GGGCAGGGGC 


2 64 0 


TGAAGGGACT 


TAAAAGACGA 


CCTAATTATA 


2700 


AATGTGATGC 


TGAAAGTGGA 


AAATGATGTC 


27 60 


GGGAGTCATG 


AT AG C AGCAG 


AGTGCTTGGG 


2820 


ATGCTACAAA 


AGAAGATGCC 


AACTAGAGAT 


2880 


TGACCGTCTA 


GCTGAGAGCC 


ATGACCAAGG 


2 94 0 


AGTCTGCCCT 


GTGGATGTGT 


ATGCCACAGA 


3000 


CATTTTAAGC 


TTTTTCTTCT 


CTTTCCCTGA 


3060 


T AAG AC C AAA 


GAGTCCAGGG 


T T AG AAT C T T 


3120 


CTGTGAGCAG 


ATGGGATTGG 


CTTTGGGTAA 


318 0 


GGTCATGGTG 


GGCCTTTAGC 


ATCAGATTCC 


324 0 


AAACAGGGAG 


CCACCAAAAC 


TTTGGGGAGC 




GCAGTGCTCA 


CTCACGAAGG 


CATTTTGCPA 




GCATTCTGTG 


CTTCTCCACC 


ACAGCACAGA 


3420 


ATTCTGAGGG 


CCAGCAGTGC 


AGGTGTGATG 


3480 


CCATCTGGAT 


ATCAGCTGCG 


TGGGCATGAT 


3540 


CACCTGACTC 


TCTTGCCAAC 


CTGCAGAGAC 


3600 


AGGAAGCTAT 


TAAGATTCTG 


GCCTTGGCTT 


3660 
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AGCTCTCAAC TGGCCATTGG TCTTGCAGTA AGTCTTTTTT CTGGGCTTCT TCTGGTCCTA 

TTTGTATGTA TTGCATTGTC ACATCATGCC TCTATCCTAG GGAATACTGT GAGCTGAAAA 

ATGAGACCCT TACTGTTCAC GTCCTGCTAA GGGGGACCGT CGTGTCAGCA CTGTAATGCA 

GTGATGTTTT TTGTGTCTTT CAGGTGACTT C ATG GTC ATG ACG ATT TTC TTC 

Met Val Met Thr lie Phe Phe 
1 5 

AAT GTG AGC AGG CGG TTT GGC TAT GTT GCC TTT CAA AAC TAT GTC CCT 
Asn Val Ser Arg Arg Phe Gly Tyr Val Ala Phe Gin Asn Tyr Val Pro 
10 15 20 

TCT TCC GTG ACC ACG ATG CTC TCC TGG GTT TCC TTT TGG ATC AAG ACA 
Ser Ser Val Thr Thr Met Leu Ser Trp Val Ser Phe Trp lie Lys Thr 
25 30 35 

GAG TCT GCT CCA GCC CGG ACC TCT CTA GGG ATC ACC TCT GTT CTG ACC 
Glu Ser Ala Pro Ala Arg Thr Ser Leu Gly lie Thr Ser Val Leu Thr 
40 45 50 55 

ATG ACC ACG TTG GGC ACC TTT TCT CGT AAG AAT TTC CCG CGT GTC TCC 
Met Thr Thr Leu Gly Thr Phe Ser Arg Lys Asn Phe Pro Arg Val Ser 
60 65 70 

TAT ATC ACA GCC TTG GAT TTC TAT ATC GCC ATC TGC TTC GTC TTC TGC 
Tyr He Thr Ala Leu Asp Phe Tyr He Ala He Cys Phe Val Phe Cys 
75 80 85 

TTC TGC GCT CTG TTG GAG TTT GCT GTG CTC AAC TTC CTG ATC TAC AAC 
Phe Cys Ala Leu Leu Glu Phe Ala Val Leu Asn Phe Leu He Tyr Asn 
90 95 100 

CAG ACA AAA GCC CAT GCT TCT CCT AAA CTC CGC CAT CCT CGT ATC AAT 
Gin Thr Lys Ala His Ala Ser Pro Lys Leu Arg His Pro Arg He Asn 
105 HO 115 

AGC CGT GCC CAT GCC CGT ACC CGT GCA CGT TCC CGA GCC TGT GCC CGC 
Ser Arg Ala His Ala Arg Thr Arg Ala Arg Ser Arg Ala Cys Ala Arg 
120 125 130 135 

CAA CAT CAG GAA GCT TTT GTG TGC CAG ATT GTC ACC ACT GAG GGA AGT 
Gin His Gin Glu Ala Phe Val Cys Gin He Val Thr Thr Glu Gly Ser 
140 145 150 

GAT GGA GAG GAG CGC CCG TCT TGC TCA GCC CAG CAG CCC CCT AGC CCA 
Asp Gly Glu Glu Arg Pro Ser Cys Ser Ala Gin Gin Pro Pro Ser Pro 
155 160 165 

GGT AGC CCT GAG GGT CCC CGC AGC CTC TGC TCC AAG CTG GCC TGC TGT 
Gly Ser Pro Glu Gly Pro Arg Ser Leu Cys Ser Lys Leu Ala Cys Cys 
170 175 180 

GAG TGG TGC AAG CGT TTT AAG AAG TAC TTC TGC ATG GTC CCC GAT TGT 
Glu Trp Cys Lys Arg Phe Lys Lys Tyr Phe Cys Met Val Pro Asp Cys 
185 190 195 

GAG GGC AGT ACC TGG CAG CAG GGC CGC CTC TGC ATC CAT GTC TAC CGC 
Glu Gly Ser Thr Trp Gin Gin Gly Arg Leu Cys He His Val Tyr Arq 
200 205 210 215 
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CTG GAT AAC TAC TCG AGA GTT GTT TTC CCA GTG ACT TTC TTC TTC TTC 4 5 64 

Leu Asp Asn Tyr Ser Arg Val Val Phe Pro Val Thr Phe Phe Phe Phe 
220 225 230 



AAT GTG CTC TAC TGG CTT GTT TGC CTT AAC TTG TAGGTACCAG CTGGTACCCT 4 617 

Asn Val Leu Tyr Trp Leu Val Cys Leu Asn Leu 





235 




240 








GTGGGGCAAC 


CTCTCCAGTT 


CCCCAGGAGG 


TCCAAGCCCC 


TTGCCAAGGG 


AGTTGGGGGA 


4677 


AAGCAGCAGC 


AGCAGCAGGA 


GCGACTAGAG 


TTTTTCCTGC 


CCCATTCCCC 


AAACAGAAGC 


4737 


TTGCAGAGGG 


TTTGTCTTTG 


CTGCCCCTCT 


CCCCTACCTG 


GCCCATTCAC 


TGAGTCTTCT 


4797 


CAGCAGACCA 


TTTCAAATTA 


TTAATAAATG 


GGCCACCTCC 


CTCTTCTTCA 


AGGAGCATCC 


4857 


GTGATGCTCA 


GTGTTCAAAA 


CCACAGCCAC 


TTAGTGATCA 


GCTCCCTAAA 


ACCATGCCTA 


4917 


AGTACAGGCG 


GATTAGCTAT 


CTTCCAACAA 


TGCTGACCAC 


CAGACAATTA 


CTGCATTTTT 


4 977 


CCAGAAGCCC 


ACTATTGCCT 


TTGTAGTGCT 


TTCGGCCCAG 


TTCTGGCCTC 


AGCCTCAAAG 


5037 


TGCACCGACT 


AGTTGCTTGC 


CTATACCTGG 


CACCTCATTA 


AGATGCTGGG 


CAGCAGTATA 


5097 


ACAGGAGGAA 


GAGATCCCTC 


TCCTTTGGTC 


AGATTATTAT 


GTTCTCAGTT 


CTCTCTCCCT 


5157 


GCTACCCCTT 


TCTCTGCAGA 


TAGATAGACA 


CTGGCATTAT 


CCCTTTAGGA 


AGAGGGGGGG 


5217 


G C AG CAAGAG 


AGCCTATTTG 


GGACAGCATT 


CCTCTCTCTC 


TGCTGCTGTG 


ACATCTCCCT 


5277 


CTCCTTGCTG 


GCTCCATCTT 


TCGTCTGCAC 


TACCAATTCA 


ATGCCCTTCA 


TCCAATGGGT 


5337 


ATCTATTTTT 


GTGTGTGATT 


ATAGTAACTA 


CTCCCTGCTT 


TATATGCCAC 


CCTCTTCCTT 


5397 


CTCTTTGACC 


CCTGTGACTC 


TTTCTGTAAC 


TTTCCCAGTG 


ACTTCCCCTA 


GCCCTGACCC 


5457 


AGGCACTAGG 


CCTTGGTGAC 


TTCCTGGGGC 


CAAGAAACTA 


AGGAAACTCG 


GCTTTGCAAC 


5517 


AGGCATTACT 


CGCCATTGAT 


TGGTGCCCAC 


CCAGGGCACA 


CTGTCGGAGT 


TCTATCACTT 


5577 


GCTTGACCCC 


TGGACCCATA 


AACCAGTCCA 


CTGTTATACC 


CGGGGCACTC 


T AAC CATC AC 


5637 


AATCAATCAA 


TCAAATTCCC 


TTAAATTTGT 


ATGGCACTGG 


AACTTTGGCA 


AAGCACTTTT 


5 6 97 




k^l {jr\ 1 J. bb 






ACATCTTTAG 


GGCAGGATTC 


5757 


TTATCCCCAT 


TTTGCAGATG 


AAAACCCTGA 


GTCACAGATT 


TCTGTGGGAC 


TGTGGATCTC 


5817 


ACTGGAAGCT 


AT C CAAGAGC 


CCACTGTCAC 


CTTCTAGACC 


ACATGATAGG 


GCTAGACAGC 


5877 


TCAGTTCACC 


AT GAT T CTC T 


TCTGTCACCT 


CTGCTGGCAC 


ACCAGTGGCA 


AGGCCCAGAA 


5937 


TGGCGACCTC 


TCTTTAGCTC 


AATTTCTGGG 


CCTGAGGTGC 


TCAGACTGCC 


CCCAAGATCA 


5997 


AATCTCTCCT 


GGCTGTAGTA 


ACCCAGTGGA 


ATGAATTTGG 


ACATGCCCCA 


ATGCTTCTAT 


6057 


ATGCTAAGTG 


AAATCTGTGT 


CTGTAATTTG 


TTGGGGGGTG 


GATAGGGTGG 


GGTCTCCATC 


6117 


TACTTTTTGT 


CACCATCATC 


TGAAATGGG 








6146 



(2) INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Met Thr lie Phe Phe Asn Val Ser Arg Arg Phe Gly Tyr Val 
15 10 15 

Ala Phe Gin Asn Tyr Val Pro Ser Ser Val Thr Thr Met Leu Ser Trp 
20 25 30 

Val Ser Phe Trp lie Lys Thr Glu Ser Ala Pro Ala Arg Thr Ser Leu 
35 40 45 

Gly lie Thr Ser Val Leu Thr Met Thr Thr Leu Gly Thr Phe Ser Arg 
50 55 60 

Lys Asn Phe Pro Arg Val Ser Tyr lie Thr Ala Leu Asp Phe Tyr lie 
65 70 75 80 

Ala lie Cys Phe Val Phe Cys Phe Cys Ala Leu Leu Glu Phe Ala Val 
85 90 95 

Leu Asn Phe Leu lie Tyr Asn Gin Thr Lys Ala His Ala Ser Pro Lys 
100 105 110 

Leu Arg His Pro Arg lie Asn Ser Arg Ala His Ala Arg Thr Arg Ala 
115 120 125 

Arg Ser Arg Ala Cys Ala Arg Gin His Gin Glu Ala Phe Val Cys Gin 
130 135 140 

He Val Thr Thr Glu Gly Ser Asp Gly Glu Glu Arg Pro Ser Cys Ser 
145 150 155 160 

Ala Gin Gin Pro Pro Ser Pro Gly Ser Pro Glu Gly Pro Arg Ser Leu 
165 170 175 

Cys Ser Lys Leu Ala Cys Cys Glu Trp Cys Lys Arg Phe Lys Lys Tyr 
180 185 190 

Phe Cys Met Val Pro Asp Cys Glu Gly Ser Thr Trp Gin Gin Gly Arg 
195 200 205 

Leu Cys He His Val Tyr Arg Leu Asp Asn Tyr Ser Arg Val Val Phe 
210 215 220 

Pro Val Thr Phe Phe Phe Phe Asn Val Leu Tyr Trp Leu Val Cys Leu 
225 230 235 240 

Asn Leu 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA {genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TCCCCGGCTC CCCACCGCAG CAGCCGTCAC GTCGTCGGAG ATTTCCATCG GGGCGGGCCT 60 

GGGGCGGGGA GCGCGGGACG GGGCGGGGCG AGTGGGAGGA GTGAAAGTTG GAGCCCAGCA 120 

AAAGCCTCCG CCCCGCGCTC AGTGCGGCCA GAGCGTGAGC CGCGACCTCC GCGCAGGTGG 18 0 

TCGCGCCGGT CTCCGCGGAA ATGTTGTCCA AAGTTCTTCC AGTCCTCCTA GGCATCTTAT 24 0 

TGATCCTCCA GTCGAGGTGA GTCTCCATCC CGGGACCCGG GAGCCCTTCG CGCCCAGCTC 300 

CCTCTCCCCG GGAGCCGGGA CGGCTCCCGG GACCCCAGCG GCCCCGCGTT CCTCGAGCCC 3 60 

y§ CGCGCCCGCT TTGCCCCGGC CCTACCGCGG GCTGGCCGAG TCCCGCGTCC CCTCGATGCG 4 20 

Jjj CGCCGGCCTC GGCCCGCCTC AC TG TAG GAT GGGCTCCCGG GGTCCTTGAG GGGGAGCTCC 4 80 

jj| AAAAGGAAGA CAGGACGCCA GAAGGAAGAC GGGACTCCAG TTCGCGGATT CCCGCTCTCA 54 0 

W AAAGCACTGC GGTGGC 55 6 

f|J 

g*" (2) INFORMATION FOR SEQ ID NO : 4 : 

St (i) SEQUENCE CHARACTERISTICS: 

1H < A > LENGTH : 1097 base pairs 

■ { B ) TYPE: nucleic acid 

CP (C) STRANDEDNESS: double 

%£! ( D) TOPOLOGY : linear 

^ (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGCACTAGCT GGGGCCCCTA CAGAGTGCAG GGCAGAGCTT CATTTTTCGT TTGAATGTTA 60 

TAGGGTCGAG GGACCTCAGA CTGAATCAAA GAATGAAGCC TCTTCCCGTG ATGTTGTCTA 120 

TGGCCCCCAG CCCCAGCCTC TGGAAAATCA GCTCCTCTCT GAGGAAACAA AGTCAACTGA 18 0 

GACTGAGACT GGGAGCAGAG TTGGCAAACT GCCAGAAGCC TCTCGCATCC TGAACACTAT 24 0 

CCTGAGTAAT TATGACCACA AACTGCGCCC TGGCATTGGA GGTGAGGAGC AGAACGACGT 300 

TCTTCCCCTC CTAGAGGGTC CAGGGGTTGA GGGCATAGGC ATGGAGAATG CACCTGGGCA 3 60 

GTAACAGAGG GTGCCATGCT CATGGACAGG AACATCTGCT ATTGACCTGT CAGGTAAGAG 4 20 

ATATTAACTC TATTCTCAGC AGTGTCATTG ACCTTGATCA AGACTTTTCC CTTCTCTCGC 480 

CCTCAGTTTT TCCAGTGGTA AAATGAGAGG ACTAAACTAG ATTGTTGATC TTCAAGATGT 54 0 
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GTGTCCAATT CTTAACAGTC CGTGAGCTTG GTTTTGCCAT GAAAGAATAA ATAAAGAAAT 600 

AGGATTAGAT GCTGAAACTG TGTGGTCCAA CACTTACTTG ACTCCCCTTT CATCCCCTCT 6 60 

GACCACTTCC TCCCCCGTCC CATGCGCCTG TTTGACACTT ACCCTCTGCT GCTTCTGCTT 720 

CCCTTATAGA GAAGCCCACT GTGGTCACTG TTGAGATCTC CGTCAACAGC CTTGGTCCTC 78 0 

TCTCTATCCT AGACATGGTG AGTACTAAGC TTTTTTAGTA CTATTTCCTA GCCCAGGGGC 84 0 

TGACCTATGG GACCTTCCAC AGACTTCTGC TTTCTGCTCT GTACTTCTGT AACAACTCCA 900 

ATAATTTATT TTCTGGAGGG AGAAAGGGAT TTTTTAACCA CTGGTTTGAG AATGAGACTG 9 60 

GAAAAGGTAA GTCCCTTGCT ACTTGAAGAG GATCTTCAGA AT CAT GAC C A TATCTTCCAG 1020 

TTTTTTCATT CAAAATAGAA ATAATAAAGC AATGTTAAAC AC CACAAT GG ACTGCCTCTT 108 0 

TTCTTCTGAT AATTATT 1097 

(2) INFORMATION FOR SEQ ID NO : 5 : 

■ifi (i) SEQUENCE CHARACTERISTICS: 
: p (A) LENGTH: 9631 base pairs 

Ul (B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

'jUj CD) TOPOLOGY: linear 

•Ly 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



TGCATGATGT 


GAGGACCTAG 


CTCCCCTTGC 


TCACGGTAAT 


CAAGCAGGGC 


AT GAC T GAC T 


60 


TCTCATTTGG 


GATCATTAGG 


CTTGGTAACC 


TCTGAGTTCC 


CTTCTAGTTC 


TGACATTGGA 


120 


AGAGTCTTTG 


AGCAGAGAGA 


AGGGACCTCA 


TCTGACTCCT 


GCTTCCTGGT 


CTAGTGCTCA 


180 


ATGCATTCCA 


CCACATTGCC 


TTTAGGATTC 


TAGGTTAGGG 


AGTGGCAAAC 


AATATGTTTG 


240 


CATGTGAATG 


TCCTTTTTCT 


GTTCATCCCC 


AAGTATGTGC 


TTTTCTGTCC 


TTCCCACCAG 


300 


G AAT AC AC C A 


TTGACATCAT 


CTTCTCCCAG 


ACCTGGTACG 


ACGAACGCCT 


CTGTTACAAC 


360 


GACACCTTTG 


AGTCTCTTGT 


TCTGAATGGC 


AATGTGGTGA 


GCCAGCTATG 


GATCCCGGAC 


420 


ACCTTTTTTA 


GGAATTCTAA 


GAGGACCCAC 


GAGCATGAGA 


TCACCATGCC 


CAACCAGATG 


480 


GTCCGCATCT 


ACAAGGATGG 


CAAGGTGTTG 


T AC AC AAT T A 


GGTATGTCAA 


GCCTCTGGAG 


540 


TCTCACTTCC 


TGGAATTCTC 


TCTCCCCTTC 


TGATAATTTT 


AGCTAAAGAT 


CCATGGGCAG 


600 


AGATCTCATC 


C T G AAT GAT A 


CCTCTAAGGG 


CCTGTCCAGC 


TTTCCTAGAC 


CATGAGCTCA 


660 


GCCCCCTTAT 


GTAACAGATA 


TAGAGGCCTC 


AAAATAGAAA 


GATATTGCTT 


AAAGC CAC AC 


720 


ACCAAGTTTG 


TGGCAGAGCT 


GGAACTGGTA 


CTCAGTTACT 


TGGCTCCGAG 


TCCAGAGCTC 


780 


CCTCAACTAG 


GATGTGCCAG 


TATGACTGCA 


T TAT C TAG AC 


AATTCCATCC 


TACGTGGGCA 


840 



CO 

':■ , I 

n i 
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CTCGATACAA AGATACGTCC ACAGTGGTGG AATTGTTCAG GCAGAGCAGC AGCACGTAGT 900 

GGCAAAGGTA CCTAAGATCA AGTTGGATAC TTGAATTCCC AGCAGGGGAA GGTTGTGTGT 960 

GGGGATAGCA GGGAGGATGT TGGCAGTTCC TGGAAACTAG GGTGGGCGAG AAAACAAAAG 1020 

CCGATCGAAG TTGCTCCATA CGTTTCTCTA ATGATGGAGC CCAGAGTAAC CAGATACTTC 108 0 

TAAGCTGTTT GTTTGTTTTG TTTTGTTTTG TTTTGTTTTG TTTTGTTTTG TTTTCTCTCT 1140 

TGTTATCTCT CCTTTGAGCT TTTTGTCTTA AATTCTAGCG AGGTCCAGGC ACGGTGGCTC 1200 

ACGCCTGTGA TCCCAGCACT TTGTGAGGCT GAGGCAGGCA GATCACTTGA GGTCAGGAGT 12 60 

T C GAGACC AG CCTGGCCATC AT GGGAAAAC CCTGTCTCCA CTAAAAATGC AAAAATTAGC 1320 

AGGGTGTGCT GGCACTAATT CCAGCTACTC GGGAGGCTAG GGCATGAGAA TTGCTTGAGC 13 8 0 

CTGGGAGGCA AGAGGCTGCA GTGAGCTGAC ATCACGCCAC TGCCCTCCAG CCTGGGTGAC 14 4 0 

AGAGTGAGAC TCTGTCTCAA ACAAACAAAG AAAAAAATTG ACTCTGGCCA TTCATTGGTG 1500 

GTAGTCCCTA GACCAAAGCT GGGTGGATAC GGAAGTGCTT AGGGCCAGCC TGATGAGGCT 1560 

CCTTTCTCCC TTCCAGGATG AC CAT T GAT G CCGGATGCTC ACTCCACATG CTCAGATTTC 1620 

CAATGGATTC TCACTCTTGC CCTCTATCTT TCTCTAGCTG TGAGTACCTT CTTAAGTTTC 168 0 

TGGGGCCCCA GAAACATGCT GGGCTCCTTC TTTTTCTCAT CCTTGCCATT TACATTTTTC 17 4 0 

TGCCTCTGCT TTTCTTCTAA AATGCTGCCA AGGTTGTGCA GGACTTCCAT CCTCCACCCT 18 0 0 

CATTTCCTTT CCTGCCAACA ATACTGTGTT GCTCATCCCT TCCACGTGCC TCTGAAGCGT 18 60 

ATCTCAAGTA TGTCTGCTCC TCTCCATCTC CACTGGCACT ACCTTGGTTT AGGCCTTTGT 192 0 

TATCTTCCAC CTGGACTTTT GCCACATCTT CACTTTGAAA CTGCACATGT CCAAAATGAA 198 0 

ATTCATTGTC TCCTCCAAAC CTCTACCACC AAAACAAGTG TGTTGCTTCT GGGTTCCCAT 204 0 

CTGTCTCATT GAAGAGGACC ATCACTCACC CAGTTGCGCA AATCAAGAAC TTTGATGTTT 210 0 

CCTCTCCCTC ACCTCCTGCA TCTAATCAAT CAGCACATCC TGTTGGTGTT TCCTCCCAGT 2160 

CTCTATCGAT GCTGTCTATT TCTCTGCACC CTGTACAGCT TTGACTTCCA CCTGCATTAA 2220 

TTTAATTCTG CCTGGATTAC TACACTGGCC TCCTTGACAA CATGTTGTCC TCACAGAAGG 228 0 

AC C AG AG T G A CCTAGCTGAA GGGTCACCTA GGTTGGGTCA CTTCTTAGTC TCGAATCTGC 234 0 

CGTTAACTCT CATGGATCAA TTTGAAATTC CTTAGAATGA ACCTCAAGGC CAT T CAT GAA 24 00 

CTGGACCCTG CCACCCAATC CTGTGCACCT CATCCTCTGT GAGCTAGCCA TCCTGAACTT 24 60 

TTGTCCTTTC CACAATACAC CAGGTGTTTC ACCTTTCTAT ACTGCCCCTT AACCCCTTCA 2520 

ACCTCATTCT TATTGAGAAT ATT TACT TGA GTTTCAAGAT TTAATGGGAA TATCACCTGC 258 0 

TTTATGAAGT CTTTTCTGAG TATGTCCCCA AGTGACCTTT ATCTACTTTG TTTCCCCGCT 2 64 0 

GTTCTGTGGA CTTAGGTTTT TCAGAGCTCC TCCAAAAATC ACAGTAGTAT ACTCACTGTC 27 00 
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TTATAAAATT AAATGTGATT GCTTGAGGGT AGGGTTCATG CCTTGCTCAT CTCTGTATTT 27 60 

CTGGCCTAGG GCCTGATACT GAGGAATGCT CAGTAAACGC ACTCATTGAA TGGACTTCAA 2820 

CAATGAGGTA AGAGAGGCAA GGTCCCACAG CTGGTGAGGC CAGAGACAGG ACTCCAAGGC 28 8 0 

ATTGTGCAGG CTGAGTTCAT GCTATTGGAG ACCTCAGGTG GGCTTCCAAG TCTCATAGGA 2 94 0 

CCCTCTTTCT CACATTCCTT TCCAGTTTCC TATCCTGAGA AT GAG AT GAT CTACAAGTGG 3000 

GAAAATTTCA AGCTTGAAAT CAATGAGAAG AACTCCTGGA AGCTCTTCCA GTTTGATTTT 30 60 

AC AGGAGT GA GCAACAAAAC TGAAATAATC ACAACCCCAG TTGGTAAGCG TGCCAGGGCT 312 0 

TGGCGGAAGT CCAGGAAGGT GGTAGGGATG ATTGGAGATG GC CATC CAT A CAAATGCTTT 318 0 

GCAGTCATCC CGT GCAAACA TTGTAAGACA TGGCTCCTGT CTTATAATTG CTAAGCACTT 324 0 

ACAACTGTTT GCAGAGGAAA CTGAGACTTT GTAACTATGT CTCAGTCTCA TCTGCAAAGA 3300 

AGTAAGTGCT TTGCCAAGCT CCTTGAGAGG TTAGGTAAGT AGATAAAGTT CTGCTGCTGT 3360 

CGGAATGTGC AGCTGGCTTT TTCATGCAGA CCCTTCAGTT TCGAGGTTAC AACTCTGACC 3420 

TCTTTGGATG ACTTTGGGGA ATGGAGCTCG TGTGAGTTCT CCATACCCAG AACCAATCCA 34 8 0 

GTCTGGTTGA ATGGGAAGCA AAGTCCATTG TAGTGGGAGG TGGAGGCTAG AGTTCTAATG 354 0 

TCAGCTAGTT TAAGGCTGGG AAAGTCTGGA GGAAGTTACA GCAGCTACAC TGGCTGCTGC 3600 

ATTGACATTT ATCTTAAAGG AACAAGTCTG AAAAGCACAG ATTCTTATCA AAGGCTTCAT 36 60 

GGTGGATTCC ACATAGACAT AGTGGCCACT GGTTTTCTGA CCTTTTCTCT GAC AAAGAC T 3720 

AAAGGGGAAG GTCCTGGGTA TCTTACACTT CAGCTCCCAA TTAGATGTGA GCACCTTCAC 37 8 0 

TTATGTTCCT AGGTGACCTG AATGAGGAGC CAAGGGACCT CCCCAGGGTA GCTCCCAGAG 384 0 

CAACCCTGGA AACACTCTTC ACACATCCTG ACCAAGTTCA GGGCAGTGAA GGCACTGCCC 3 900 

TCATCGTTTC CAGAATGTGG ATGGAGCCAG TCACCCAACC AGCCATTTGT CGTGAGAGGC 3960 

ATCTTGTTCT GCTACCATGT GACTAGGCAG AAAATCTGCT TTTGTTTCAT TTATTGAGTC 4 02 0 

AGTCTCTGGA TGAGGGAAAG CTCATGCTCA TGTGGCTAGA GCTTTGCTTG CACAGTATTA 4 08 0 

GGCAGGGGCA GAGGGCTGGG CTACCTTAAA AATACTTGCC CTTTTTCTTG GGGACTCTGG 414 0 

GGAAGCGGTT TTACTACCTT TGACTTGGGA GCCTTGCTCT TCTGCCAGCT AACCATGGGC 4 200 

CTGCCTCTTG GTTTTCTGCA CCTCAGCTTT TCCCGGATAG GTGGGGACCC AT CAT C AAAA 4 2 60 

GTGACAGAGA AGATAAGGCC CAGGGGCTTT CAAGTCACTA GTGGTTCCGT TTAGTAGATG 4 320 

ATTGTGCATT GTTTCAAAAT GGTGCCCTAG T GAC T AC AAA GCCCCAGAGC CAGCATCATC 4 38 0 

AT CAAAG CAA TGACAGTAGG TAAGCACCAG ACCTCCTTGG GAGTGAGGAG GATTCTTGAG 44 4 0 

GAGAAAAGAG GTCTTCTTTC TCCTCTGCTG GAGACTAGTT GATCTGGAGA CGTGGTTCCT 4 500 

TCAATGTCAG AGTTATCTTT GGGACTGGTC TCAAACTCTT CCAGTTGGGC CCTGGGGCAG 45 60 



GTCTCTCCAT CTGGAGCATA CTTACGTGCT 
GCCTGCTACT CTGGCCATCT TGGACCTTGA 
AGAGAGTCCA GCCCTGCCTC CAGAGAGAGG 
TGATGGAATA TTATTGCTGG TCAGGAATTC 
TAGACAGTTC TGTTCATCAA AAAACTCTCC 
TTCCACACAT TAAGCTTAGT TCTTTTTCCT 
ACTGTTTCAC AGGACCGAGA TTAAACGATC 
CTGTTGTAGT CTACATAGCC CTAGTTTATT 
CCATTCATTC TTCTAGTCCC AGTCCTCTGG 
ACTGCAGTGA CCAACCATGA TCTGGTCAAA 
T TAG CAT AT A TTTTTTTGCT TTGGTTCATC 
Jff GCTTCTCTCA CTCCTCAAGA AGAAGAAAGT 

IK GACAAATTGG CATCTCTTGA CGAGCGGAGA 

Lif 

W TAAAATAGAA TCATCCAACA GAATAATAAA 

fff 

yj CCCAGCAATT TTGCAGGGCC TGGGTATAAC 

5 

ATCCCACGGC AAGAGGACTT TTGATAAATA 
2 s ; GTCCGTGCAG CAGCTCTGGA GGAGTTCTAT 

ffj CACTGACAAC AGGTCGCTCC CTTGCCTTCT 

ji GAGTTTCAGT GCTGCCAAGG TGAGTACGAA 

^ CCCAGACCTG CTGGGCAGTT GATCCACTTT 

TGTTCTGGTC TGCTTTACAA AGCTGTTGCC 
TGGGTTCCAG GCTGTTCTGA GCTTGGATGG 
CCCTGCCTTG AAGGGACTTA AAAGACGACC 
GCAACTGCAA TGTGATGCTG AAAGTGGAAA 
ACCAGCCTGG GAGTCATGAT AG C AG C AG AG 
CAGCATGAAT GCTACAAAAG AAGATGCCAA 
AGGAGCCATG ACCGTCTAGC T GAGAGC CAT 
AGGACCTCAG TCTGCCCTGT GGATGTGTAT 
GAAATGTTCA TTTTAAGCTT TTTCTTCTCT 
T CAT AG ACT A AGACCAAAGA GTCCAGGGTT 
CAGGAAATCT GTGAGCAGAT GGGATTGGCT 
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CGGCGATTAA 


GGGTTCAGAA 


TGCAGTGGTA 


4620 


TCCAGAGAAT 


CTCTGCTTCA 


GGAGCTTCTA 


4680 


CTTGCCCTTC 


ACTGATGGCT 


GTGGAGCCTC 


4740 


ACTGTCTTAC 


AAGGAGGTTT 


CCTTCTTCTC 


4800 


CTGTTCTTCT 


GAAATTGGAG 


TCTCTGGAAG 


4 860 


TGGAACTGTC 


CAGGTTACAT 


TAGTCCAGCC 


4 920 


AACATCATCA 


TTCCCGGCAT 


GGATCATAGT 


4 980 


TTTCTTCCCT 


TATTCTTCAA 


AGCTTTGGGT 


5040 


ACATGGTCTA 


TTTAATTGTG 


TCCCTCTGAC 


5100 


GAGGATAAGA 


GTTTGAGCAG 


AAAAC CAT C T 


5160 


AGCCCCAGAT 


ATATTGTTTT 


CCTTACCCGT 


5220 


GTGTGTTAGC 


ATCTTTCTCT 


TGTCCTTCAA 


5280 


AGGTTCTTTT 


TTGGCCAGAA 


TAAATAAAAT 


5340 


TCTTCGTGCA 


ACAAGAATAT 


AT TAT AT AAA 


54 0 0 


TAATTAGAAG 


TGTCTTAAAT 


TGCAGTCAAG 


54 60 


CATTCTGGCC 


AGTAGGCAAG 


TGCGAGGGTG 


5520 


CCCAAAGCTA 


TACTCAACAC 


ACAGGTTTCC 


5580 


T C C AG AAG AA 


TCTGAGAAGC 


TTTGCTCCTT 


5640 


AGGCTGCTCT 


TCTCATTCAG 


CTCCAGCCCA 


5700 


CCAAAATAGG 


AGGACACACG 


GACAGGTTAG 


5760 


TGACAGGAGC 


AAGAGTTGCT 


GAGTGTCTGC 


5820 


GCAGGGGCTA 


AGCCACAGGG 


CCTGCATGAG 


5880 


TAATTATAGG 


CCTAGGAATT 


TTACAGTATT 


5940 


ATGATGTCCT 


GGGCTCAGAG 


AAAAGCCCAC 


6000 


TGCTTGGGGA 


GGGTGTGTCA 


GAGCATAAAG 


6060 


C TAG AG AT AT 


AGGTTGTCAT 


CAGGTCCCGG 


6120 


GACCAAGGAC 


ACAATGTCCA 


AGTGACTGTG 


6180 


GCCACAGACC 


TGACTTCTGG 


AGGGCTGACT 


6240 


TTCCCTGAAA 


CACTCAGTTT 


GGGTTAGGGG 


6300 


AGAATCTTGG 


TGTAAAATTG 


CAGGCCATCT 


6360 


TTGGGTAAGG 


TGCGTGTGGA 


AAATGTCAGT 


6420 



GGGAGCCGGG TCATGGTGGG CCTTTAGCAT 
CTCATGTGAA ACAGGGAGCC ACCAAAACTT 
CCTGTGGAGC AGTGCTCACT CACGAAGGCA 
GGCCAAAAGC ATTCTGTGCT TCTCCACCAC 
GAG TAG AC AT TCTGAGGGCC AGCAGTGCAG 
TTAGTCAGCC ATCTGGATAT CAGCTGCGTG 
TTCACTTTCA CCTGACTCTC TTGCCAACCT 
AACTGAGAAG GAAGCTATTA AGATTCTGGC 
TTGCAGTAAG TCTTTTTTCT GGGCTTCTTC 
ATCATGCCTC TATCCTAGGG AATACTGTGA 
CCTGCTAAGG GGGACCGTCG TGTCAGCACT 

fl 

GGTGACTTCA TGGTCATGAC GATTTTCTTC 
rj TTTCAAAACT ATGTCCCTTC TTCCGTGACC 

W AAG AC AG AG T CTGCTCCAGC CCGGACCTCT 

Pi 

It! AGGCACATGG CTGGGAGTTG GCTGGGCCAG 

ACCAATGTCG CACATAGGTC AGGAGCTGAG 
y CCTCTGTTCT GACCATGACC ACGTTGGGCA 

i y 

PJ C C TAT AT C AC AGCCTTGGAT TTCTATATCG 

L ~f; TGTTGGAGTT TGCTGTGCTC AACTTCCTGA 

w CTAAACTCCG CCATGTATGA GCTGGGTATG 

ACATGCTAGC AAGGGTACTG GGGTTATGGC 
ATGTTACCCA GGGTGGTGGC GGGGTTGAAT 
AGGAGAGAAG CCAAGATATG GTGTGCCAAA 
TTAGACTTGG GTTATCTTCC CTTGACTTTT 
CGTATCAATA GCCGTGCCCA TGCCCGTACC 
CAT CAGGAAG CTTTTGTGTG CCAGATTGTC 
CCGTCTTGCT CAGCCCAGCA GCCCCCTAGC 
TGCTCCAAGC TGGCCTGCTG TGAGTGGTGC 
CCCGATTGTG AGGGCAGTAC CTGGCAGCAG 
GATAACTACT CGAGAGTTGT TTTCCCAGTG 
CTTGTTTGCC TTAACTTGTA GGTACCAGCT 
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CAGATTCCAG 


AGTGCAGATA 


GTCTGTATAG 


6480 


TGGGGAGCAG 


GCTAGTGCCG 


GTTTTGACCA 


6540 


TTTTGCCATC 


ACATGAATGT 


GCAGAAAGGA 


6600 


AGCACAGACT 


TCCCTAGTCT 


CATTTGCTGA 


6660 


GTGTGATGTG 


CCTCAGAGGG 


TATGAAGCCC 


6720 


GGCATGATAT 


CTAGAAGGCT 


AATTGATTTT 


6780 


GCAGAGACAG 


ACATTGGGTG 


TAGGACAGTG 


6840 


CTTGGCTTAG 


CTCTCAACTG 


GCCATTGGTC 


6900 


TGGTCCTATT 


TGTATGTATT 


GCATTGTCAC 


6960 


GCTGAAAAAT 


GAGACCCTTA 


CTGTTCACGT 


7020 


GTAATGCAGT 


GATGTTTTTT 


GTGTCTTTCA 


7080 


AATGTGAGCA 


GGCGGTTTGG 


CTATGTTGCC 


7140 


ACGATGCTCT 


CCTGGGTTTC 


CTTTTGGATC 


7200 


CTAGGTAAGA 


GGAGAAACAG 


GTATACGCAT 


72 60 


GGCAGAGTTG 


CCTTGTCATG 


GAGTCTTTTA 


7320 


CCCATCACTC 


TTGTGCTCTT 


GCAG G GAT CA 


7380 


CCTTTTCTCG 


TAAGAATTTC 


CCGCGTGTCT 


74 40 


CCATCTGCTT 


CGTCTTCTGC 


TTCTGCGCTC 


7500 


TCTACAACCA 


GACAAAAGCC 


CATGCTTCTC 


7560 


GGAGTGGTGG 


CAAGGCTTTG 


GAGTGTAGAG 


7620 


ACATGGGTGG 


TCAGCTTGCT 


GAGTGATGGA 


7680 


CAACTTCCTG 


ATGTAATGGT 


GAGAAGTTGG 


7740 


GACAGTTTCC 


AGAAAAT C C G 


GAGGCAGCAC 


7800 


CCCCACTTCT 


TTCCTTGTCC 


ATTTTAGCCT 


78 60 


CGTGCACGTT 


CCCGAGCCTG 


TGCCCGCCAA 


7 920 


ACCACTGAGG 


GAAGTGATGG 


AGAGGAGCGC 


7 98 0 


CCAGGTAGCC 


CTGAGGGTCC 


CCGCAGCCTC 


8040 


AAGCGTTTTA 


AG AAG T AC T T 


CTGCATGGTC 


8100 


GGCCGCCTCT 


GCATCCATGT 


CTACCGCCTG 


8160 


ACTTTCTTCT 


TCTTCAATGT 


GCTCTACTGG 


8220 


GGTACCCTGT 


GGGGCAACCT 


CTCCAGTTCC 


8280 
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CCAGGAGGTC CAAGCCCCTT GCCAAGGGAG TTGGGGGAAA GCAGCAGCAG CAGCAGGAGC 834 0 

GACTAGAGTT TTTCCTGCCC CATTCCCCAA ACAGAAGCTT GCAGAGGGTT TGTCTTTGCT 8 4 00 

GCCCCTCTCC CCTACCTGGC CCATTCACTG AGTCTTCTCA GCAGACCATT TCAAATTATT 8 4 60 

AATAAATGGG CCACCTCCCT CTTCTTCAAG GAGCATCCGT GATGCTCAGT GTTCAAAACC 8520 

ACAGCCACTT AGTGATCAGC TCCCTAAAAC CATGCCTAAG TACAGGCGGA TTAGCTATCT 85 80 

TCCAACAATG CTGACCACCA GACAATTACT GCATTTTTCC AGAAGCCCAC TATTGCCTTT 8 64 0 

GCAGTGCTTT CGGCCCAGTT CTGGCCTCAG CCTCAAAGTG CACCGACTAG TTGCTTGCCT 8 7 00 

ATACCTGGCA CCTCATTAAG ATGCTGGGCA GCAGTATAAC AGGAGGAAGA GATCCCTCTC 8 7 60 

CTTTGGTCAG AT TAT TAT GT TCTCAGTTCT CTCTCCCTGC TACCCCTTTC TCTGCAGTTA 8 820 

GATAGACACT GGCAT TATCC CTTTAGGAAG AGGGGGGGGC AGCAAGAGAG CCTATTTGGG 8880 

ACAGCATTCC TCTCTCTCTG CTGCTGTGAC ATCTCCCTCT CCTTGCTGGC TCCATCTTTC 8 94 0 

GTCTGCACTA CCAATTCAAT GCCCTTCATC CAATGGGTAT CTATTTTTGT GTGTGATTAT 90 00 

AGTAACTACT CCCTGCTTTA TATGCCACCC TCTTCCTTCT CTTTGACCCC TGTGACTCTT 90 60 

TCTGTAACTT TCCCAGTGAC TTCCCCTAGC CCTGACCCAG GCACTAGGCC TTGGTGACTT 912 0 

CCTGGGGCCA AGAAACTAAG GAAACTCGGC TTTGCAACAG GCATTGCTCG CCATTGATTG 9180 

GTGCCCACCC AGGGCACACT GTCGGAGTTC TATCACTTGC TTGACCCCTG GACCCATAAA 92 4 0 

CCAGTCCACT GTTATACCCG GGGCACTCTA AC CAT C AC AA TCAATCAATC AAATTCCCTT 9300 

AAATTTGTAT GGCACTGGAA CTTTGGCAAA GCACTTTTGA CAAGTTGTGT CTGATTGGAG 9360 

CTTCATGATA GCCTTGTGAC ATCTTTAGGG CAGGATTCTT ATCCCCATTT TGCAGATGAA 9420 

AACCCTGAGT CACAGATTTC TGTGGGACTG TGGATCTCAC TGGAAGCTAT CCAAGAGCCC 94 80 

ACTGTCACCT TCTAGACCAC ATGATAGGGC TAGACAGCTC AGTTCACCAT GATTCTCTTC 9540 

TGTCACCTCT GCTGGCACAC CAGTGGCAAG GCCCAGAATG GCGACCTCTC TTTAGCTCAA 9600 

TTTCTGGGCC TGAGGTGCTC AGACTGCCCC C 9631 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
- (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TTCTACCAGA CCTGGTATGA TGAGCGCCTC TGTTACAATG ACACCTTTGA GACACTTATT 



60 
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CTACATGGCA ACGTGGTGAG CCAGTTGTGG ATCCCGGATA CTTTTTTTAG GAATTCTAAG 12 0 

AGGACCCAAG AG TAT GAT AT CACCATACCC AACCAGATGG CTCTCATCCA TAAGGATGGA 18 0 

AAAGTGTTGT ACACAGTTAG GTATGTCAAG CCTTTAGTGT CTCACTTTCT AGGGCTCTCT 24 0 

CACTCTTCCA GAAAATTTTA GCTAGGGACT CCTAAGTAAA TATATCATTC TGAATAATAT 300 

TCCTAAAACC CTGTGCAGCT TTTCTAGATC AGGAATTCAA CCTCTTCCTG TGCAAATATA 3 60 

AGACACTGAG AT AT T AAAT A AAAGATTTTA AAGATGCATG CCAAGTGTTT TGGTAATCAT 42 0 

GGAGCTGTAT TTGCCCCACA GTCTAGAAGG TAGGTGCTGG CCTGACTGTA TTAGTAGATA 480 

ATTCCATCCC AAATGGGCAT AG TAT T T AAA GACACGCCCA C AG T GAC AG A ATTGTCCAGG 54 0 

CACAGTATAA ATACATCGGT GGAAAAGTAT TCAGGACTAG GATACATAAA TTGAATGCCT 600 

GGAAGGAGAT GCTTGTGTCA GCAATATGGG ATGTGGAGGT GGTGCAAAGT AGCACAGGAC 660 

TAGAGCACTG C AC G AAC AAG AAAGAAAAAT CAATCTAGGC TGGCCCATGT ACTTCTTTGA 72 0 

TGATGTAGCC TAG TAG TC AG ATTTTAGAAG TTCTTCTTCC TCTCTTTGTA TCATTCCTTT 7 80 

GGGCTTTTTG TCTTAAATTC TACTAATGGC AC T CAT T AGA TATGATACCT GGTCAATGTT 84 0 

GAGTAGATAT CAGTGCAAAG AGCCAGCCTA ATGAGGCTCC TTTCTCCCTT CCAGGATGAC 900 

CATTGATGCA AGATGCTCAC TCCACATGCT CAATTTTCCA ATGGATTCTC ACTCTTGCCC 960 

TCTGTCTTTC TCTAGCTGTG AGTACCTTCT TAGGTTTCTG GGGATCCAGA GATATGCTGG 102 0 

GCCCCACTTT TGCTCATCTT TGCCTTTTAC ATTTTTCTGC TTCTGCTTGT TTCCTGTGAT 1080 

GGTGCCAGGG TTGCTATAAG CCTCCATTTT CCATCATCAC CCCTTTTCCT GTAAGCAGTG 114 0 

CTGTGTGTGT ATTGCTTTGG CTTTATGTTG AATTCTTCCC TCACCCTTGC TCTACCTTAA 1200 

GCTAGTCACC TGCTTCTGAA GTATATCTGA AGTATGCCTG CTCCTCTCCA TCTCTGCCTT 12 60 

GATGCAGGGC TTTAATCATC TTCTGCTTGG ACTTCTACAG TACCTTCACT TTGAAACAGA 132 0 

GTGTGTCTTA AGCTAAATTC ATTTTTTTCT TCCCAACACC TGCCTCCAAA GCAAATGTGT 1380 

CGCTTCTGTG TTGCCTCATT GAAGGGTGCC ATCATTTACC CAGCTACCCA AGCCAAGAAC 14 4 0 

TCTGACAGCC CCATTCATTG ACCTCCTACA TCTACTCAGT TCACACATCC TGTTGGTGTT 15 00 

CCCTCCAAAT CTCTATTGAT CTCACCTCCA TCTCTGCATT CT GAC TAT TA TTAGCTTTCA 1560 

TTCCTCATTT CAAATGTGTT CTTTGCTCGC CTGGATCAAT TCTAAATGCC TTAAAATGAA 1620 

CCTCAAATCC CTTCATGAGC TGACCCTGCT GCTTCCTGCT ATACAACCTA TCTGTGAAAT 168 0 

AGTCTCCCTA AAGTTTTGCT GTTTCCACAT TCACCTTTCT GTGCTTCCCT TTCACCTTCT 17 4 0 

TCAACCTCAT CTCTATTGAG AATGTTTATT TTCAAAACTT AAAGTGAATG CCCCATGCTT 180 0 

TAGGAAGTGG GTCCCAAGAG ATCTTTCCCT ATTTTGCCAA TTGTTCCATG GGTAGAGATT 18 60 

TTTCAGAGCA ACCCCCAAAC TCACAATTGT ATATGGCCTT ACTCAATGCA ATGTGAATGC 1920 



TGGAGGGTAG GTTCACGCCT TACTCATCTT 
TTGAATGCAC TTCAACAACT GGGCCAAAGG 
AAAAGTCCAA AACATGGCTC AAGCTAGCAC 
CAGTCTCATT CATTCCTCTT TTCCACATTG 
ATATACAAGT GGGAGAATTT CAAACTCAAA 
GAGTTTGATT TTACAGGAGT GAACAACAAA 
TGTGCAGTGG GAGCTAGAGC TGGGCAGAGT 
GCTATTTGTA CCAATGCTAT GTAAAGAGCT 
CTCTCATATA TGCTAAACAC TTACAACTGT 
GTCTCAGTCT CATCTGCAAA GAAGTAAGTG 
GTAGATATAT GTTTGTAACC TTCAGTATGT 

uQ ■ TATTTAATTT CTTTCTTATA TCACTGACTA 

O 

s„"i TGTGATTTTA CCAAATCCTG AACCAGGTCA 

^ AGCAGGAGGT AGAAGTGGAA ACTAGAGTGA 

lU GCAGATGGTT ACTCAGGTGC AATGGCTACA 

ry 

3 ' GAGTACATAT TTTCACCAGA GGCTTCAGAG 

Sf! TGATCTTGTT CCTGACAAGG ATAATAGGGG 

2 LI 

pJ CAAATAGATG TGAGCATCTT TGCTTGTATT 

y 3 

tf| TCTTCTCAGG GTACCTCTGA GAGCAACCCT 

TGGATGGATC AGGTCACCCA GCCAGACTTT 
TGATGACATA AAAAAATCTG CTATTGCTAC 
CTAACCCAAC TAAAGCTTTG CTTATACAAT 
TTCCTGGGAC TCTGAGAAAG CCTTCTTGCT 
T AC AT AAAT G TGGGCCTGCC TCTTAGTATT 
TCAGAAGTGA CAGAAGAGAC AAGGTTCAGG 
TAGATGGTTT TTGCATTGTT TCAAAATGGT 
CAT CATC AG T GAAGCAATGT CAGTAGGTAA 
GGTCTTCTTT TTTCATTGAC T AGAGAC T AC 
AGTTATTTTT GAGTCAGGTT CCAACTCTCC 
TAGAACACAC AT G T AC T CAT TTTTCAGGGT 
CATCTTGGAC CTTACAGCAG AGTTTCTGCT 
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TGCATCCGAA 


GCCCAGGGCA 


AATGTACTTA 


1980 


CAGGGTCTCA 


AAGCTGGTGA 


AGCCAAAGTC 


204 0 


CATGCTTGTG 


GGGCCCTCAA 


CTGGGTTTCC 


2100 


TTTTCAGTTT 


CCTATGATGA 


GCATGAGATG 


2160 


ATCGATGCGA 


AGAACACTTG 


GAAGCTATTG 


2220 


AC T GAAAT C A 


TCTCCACCCC 


AGTTGGTAAG 


2280 


CTGAGGGTGT 


AGTACCAACA 


ACTGGAAGTG 


2340 


TATGGAAACA 


CTGTCTAGGA 


CATCGCTTCT 


2400 


TTGCAGAGGA 


AACTGAGACT 


TCATAGCTAT 


24 60 


CTTTGCCAAG 


CCCCTTGAAG 


GACGGGGTAA 


2520 


TAATCTACCT 


TTTGAGTTTT 


AGATCTTTTG 

nvjii x v_x x x x x \_J 


258 0 


ACTTAGACAA 


CATTGGGGAG 


GGGTACTCTG 


9 fi4 0 


AGTTGGTTGA 


GTGGGAAGGA 


AG AC C AC C C T 


2 7 0 0 


CAATGGCGGC 


TAGTAGGGAT 


TGAGGAGTCT 


91 fin 


GTAATATTAT 


CTTAAAGGAG 


CAACTGTGAA 


2820 


TGGAATCAAC 


ATAAACATGT 


TTTTTCCTfT 

X X J- X -L \j X \jVj 


28 8 0 


CAGATTCTGG 


GTCTCTTAAC 


TTGTATCTCC 


9 94 n 


CCTAGGTGAT 


CTGAATGAGG 


AGCCAATAAA 


?nnn 

o u *j u 


GGACCTTCTT 


CACATGTACT 


GACAAGA ATG 


sn fin 


TGACATTTAT 


TTTGCTTTGC 


TTCTGTGTGA 


3120 


TGTTGGTCAC 


TCGGT GAGGG 


GCAGCTTATG 


ftn 

o J- o u 


ACTAGGCAGG 


GATGGGTGCT 


GCTTGTCCTG 


324 0 


TGCTTTGACT 


TGGGGTTCTT 


GTTCTTCTTC 

^ X X >^ X X ' X X \s 


3300 


CAGTTTGTCC 


AGATAGGTGG 


AGCCTCATCA 


fin 


GGCTTTCAAG 


TCACTAGTGG 


X X W VJ XXX 


^4 9n 


GCCCTAGTGA 


CTACAAAGCC 






GCAACATCTC 


TTACTGTGGA 


GAAAGAAATA 


3540 


TTATTCTGAA 


TTCATACTGT 


CTTCACTGTC 


3600 


CAGTTTGACC 


CTGGGGCACT 


TCTTTCTAGC 


3660 


T T AAAT GC AG 


CAGTGCCCTG 


CAACTTGGGC 


3720 


TCAGGAGCTT 


CTCAGAAAGT 


GCAGCCTTGT 


3780 
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CTGTAGTAAG 


TAAACCATCC 


TTCACTGGTG 


GCTGTTGAGC 


CTCTGAGGAA 


ATCTTGCCAC 


3840 


GATCAAGAGG 


TCAGCATCTT 


ACAAGGAGAT 


TATTTTCTTC 


CCTAGACGTC 


TCTTTCCATC 


3900 


AGAAAATTCT 


TCTTGCTCAT 


CTAAAATGGG 


AGTTTCTGGA 


ACATTTACCC 


ATCAGATCTA 


3960 


GTTCTTTTTC 


CCCTATAACT 


ACCCAGTTTA 


CATTAATTTA 


AGGTACTCTT 


TCAAAAGTCA 


4020 


AATGTAAAGA 


TCTAGGGTTA 


ACATTAGAGC 


ACACATTTGT 


ATTCTATTGG 


GACTCTCCTC 


4080 


CCTAGTTCCT 


TTTCTTCTCT 


TGTTCTTCAG 


TTCACTAGTC 


CTCTGGGCAG 


TCTTTTAATT 


414 0 


GTGTCTTTTG 


ACAGTGCAGC 


CATCAACTAT 


GATCTGGACA 


AAGCAGGTAA 


GAACCAAAGT 


4200 


CATCTTTAAT 


CCATTTTGTT 


TTGGTTTGGC 


TTTCAGTCCT 


AGATACCTCT 


CTTCCATTGG 


4260 


TATTGCTTCT 


T CAAT CAT C A 


GAATGAAAGA 


AACACTTATT 


AGTATCCTTC 


TCTTGTCCAA 


4320 


TTTCCAGCTT 


TTAGTAAATA 


AAACATTTCT 


TCTTTAATCA 


GACTGACTAA 


AATTTAAAGA 


4380 


AT G AG AAAC C 


CAACAGCACA 


ATGCATCTTG 


TG AC AT AC AT 


ATTAAGGGAG 


TTTTTAATCA 

X _L X X X ilUi. x on 


4 4 4 0 


CAGTCAAGTC 


TCCAGGACAG 


AAAGAATAAT 


AATTTGGTCA 


TTCTGGCTAG 


TAGCATAGAA 


4500 


TGGTTTCTGG 


TGCAGTGCCG 


GAGGCATTCT 


AGTATTCAGG 


CAGGAAGTTT 


CCCACTGACC 


4560 


ATAAGCTACT 


TATTTGCCTT 


TCTCTGAAGA 


ACCTGAGAAG 


CTTTGTTCCT 


CCAAGGT CAA 


4 620 


GTCTCCTAAC 


ACTAGGAGGA 


AAGGTTGCTC 


TCGATCTTCA 


CGGCCCACTC 


AGACTTGCCA 


4 680 


GGTACATGGT 


CTGTTTTCTG 


TATAGTCTAC 


AGTCTAGACA 


TTTAGTGTTT 


ATGTCTGCCT 


474 0 


TCCAAGGTTA 


TTTTGACCAG 


GGGTATCAAA 


AGTCACTGAG 


CACCATTGTT 


TCCCATGTTA 


4 800 


TGCCAGGCTT 


GGTTGGCTAG 


GGGCTATGTT 


AAAGGCTTTA 


TATGAGCCCT 


TCCAGGCAGG 


4 860 


AGCTTGAAGG 


TCCATTTAAT 


TTAAGGTCAC 


AGGTGGCTGT 


TGCATCACAG 


AACTGGAAAT 

xt x vj ^wJiJXX£i X 


4 920 


TGAGGGCTTA 


ACTTCACAGA 


AAAAT C CAT G 


G T T G AG AC AA 


TTTAATGTCT 


AATGACATGG 


4 980 


AGAGTAAAAG 


TGATCCTCTG 


GGCAAACTAA 


GCATGGAGTC 


ATAAT AT TAT 


GAAGAGCATT 


5 04 0 


AGAAGCAACA 


TTAGGAACAG 


AATATGCCAG 


CTGGAAATAC 


AGGCTTGATC 


AACTTCTGAA 


-J -L KJ \J 


AGATCCATTT 


CTATCTAAAG 


TAAATACATG 


AGAAGGTCAT 


TATACCCAAG 


TGAATGTQAA 


si &n 

J1DU 


GGCCTACACT 


GCCTTCCGTC 


TGAATCTTTT 


GCAAACCTAA 


CATCTAAAGC 


TTGACTGACA 


522 0 


AGTTCATATT 


CATCTACTTC 


TATTTTCCTA 


GGACATTGAA 


TTTAGGTTAA 


GCAGTAATAH 


S ? R 0 

■*J O \J 


AT C AAGAAGA 


GGGAGTCCAG 


GATTTGCTTA 


CCAGTATAAA 


TATATGTGGT 


ATCTAAACTG 


534 0 




<~>£\JA X 1 ±±\j 111 


1 Abb I CjjAGGC 


ATTCATGTTG 


TATGTGTGCA 


TAT G TAT GAG 


5400 


TCAAATCCAG 


TTAGGTCTTA 


GAATCAGAAT 


ACATAGTGTA 


G AC AC TAT T A 


TGTAGCTCAT 


5460 


ATATAGCAGA 


AT AT GT CAT A 


TTTGGCTAAC 


CCAAGTGGGC 


AT GC TAT GAG 


GACTAACAGG 


5520 


GCAGATAATG 


TTGTGACATC 


AC AAAC TATA 


CATTGCATTT 


CTGTGGGTCT 


CAGCTAGCCA 


5580 


TCAGCTATTA 


GGTTTTAATA 


C CAT GAAAG T 


AAAGGTTATG 


GGGTTGTTTC 


TTGTTTGTTT 


5640 
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GATTTGTTTT 


GAACTTTTCT 


CTCACTACTG 


CTTCTCTGGG 


AAAAACAGAT 


TTTGGGAGAA 


^7 on 


TGAATCAGTT 


AACTGAAAGT 


AGGTTTATTA 


ATACTCTTGT 


ACCAGCTTAT 


CTCTCAATTA 


57 60 


GCAATGGGTC 


TTACAGTAAA 


TATCTGAGTT 


TCTACTTTGC 


TGCACTGAAA 


TACTGTGCCT 


5820 


TTATTATAGA 


ACTTTGAAAG 


GGTTTCCCAT 


GAGCTGGATG 


GTGGGAGCCA 


CACTATATGT 


588 0 


GCCATGCTGG 


ACATCAGGCC 


AGTACTGCAA 


TGCAGTGGTG 


TATTTGATGT 


CTTTCAGGTG 


S 94 f) 
J u 


ACTTCATGGT 


CATGACATTC 


TTCTTCAATG 


TAAGCAGGAG 


GTTTGGCTTC 


ATTGTCTTTP 




AAAAC TAT AT 


CCCTTCATCT 


GTTACCACAA 


TGCTTTCCTG 


GGTGTCCTTC 


TGGATCAAGA 




TAGAAGCTGC 


TGCTGCCAGG 


GCCTCTGTAG 


GTAAGAAAGT 


ATGGTGATCT 


TAAATGTGAT 




CATATGTCAT 


AGATTAACAT 


TGCCTTCTCC 


TGTTGTCCTG 


CAGCACTGTC 


AGGAGCTTAG 


UlOU 


CCCACCTTCA 


CTCCTTGCTC 


TTATAGGGGT 


CAGTTCTGTG 


CTCACCATGG 


CCACACTGGG 


u y u 


TACCTTTTCT 


CGTAAGAATT 


TCCCTCGTGT 


CTCCTATCTC 


ACAGCTTTGG 


ACTTCTATAT 

-^■w X X x n X rl X 


UJUU 


TGCAATTTGT 


TTCGTCTTGT 


GCTTCTGTAC 


TCTACTAGAG 


TTCACTGTGC 




OjOU 


GACCTACAAT 


AATATTGAAC 


GACAGGCTTC 


TCCAAAGTTC 


TACCAAGTAA 


GAACCACGTG 


642 0 


TTGGTATGGG 


AAGCAAAACT 


T TAG AC T GAA 


GATAGGAGGG 


TTATGGCACT 


TGTGTGATTA 


f>& ft 0 


CCTTTCCAAG 


TGATGAGAAT 


TTCCCTGGGG 


TTATAGAGGA 


AATAAACTTG 


^ ± vJ J. Orl x Jr\ x O 


£ r ^ n 

D O ft U 


CAGGAAGGCC 


TAGAACTGAT 


GTGGAGAGTG 


TAGTTTCTAG 


GAAATTCTAG 


GCAACTTTTr 

\J V^iTUTiN^- X X X X w 


0 u u u 


TTCTTGACTC 


ACTTTCCCTT 


GACCATTTTG 


TTATCTTTAT 


TTTTCTATTT 


CAGTTTCVAA 


w U U U 


CCAATAGCCG 


TGCTAATGCA 


CGTACTCGTG 


CTCGTGCCCG 


CACTCGTGCT 


CGTGOTCGTG 


£7 9 f) 


CCCGTGCTCG 


TCAGCAGCAG 


GAAGTGTTTG 


TGTGTGAGAT 


TGTTACCTAT 


GAGGAAAATG 


{^7 fi n 


CTGAAGAGGG 


TTACCAGTGG 


T C T C CAAGAT 


CAAGAAGACC 


TCAGTGTCCC 




0 0 *± u 


GTGGCCGAAG 


CTATGTGTGC 


TTCAGGGTTC 


T CAGGAAGT A 


TTTCTGCATG 

x x x x \j x \j 


GTTrT'Tf^-f^TT 


D^UU 


GTGAGGGCAA 


CAGCTGGCAG 


CGGGGCCGCA 


TCTGCATCCA 


CGTTTATCGG 


\_. l bb/i x ±\£\K_, l 


£ Q (Z r\ 


ACTCGCGGGT 


GCTTTTCCCC 


ATTACATTCT 


TCTTCTTTAA 


TGTGGTCTAG 


luuijluril X 1 


Toon 
/ uzu 


GCCTTAACCT 


GTAGGCTCCA 


GCTGGTAGCT 


CATGGGGCAA 


TCACCTCAGT 


-i- K^J-$\j \jJr\\j 


/ u 0 u 


GTCCTAAGCC 


CCTTTGTCAA 


GGGAGTTGGG 


AG AC AAT AG C 


AACAGCAGCA 






GAGTTTGTCT 


TTGCTGCTCC 


TCCTTTTGTT 


GGCCCCTTCA 


CTCAATCTTT 




/ z U U 


TTCTCAAGTA 


ACAGCCCACC 






TCCATGATGT 


TCAGTGTGCC 


7260 


CACAATTGAG 


CAGATAAGCT 


ATCTCTTGGT 


TGTGTTTGCA 


GTTACTGTTT 


TTCCCCAGGA 


7320 


GGTCATGGTA 


CCATTTGTGG 


TGCTAAGGGG 


CCCAGCTCAA 


GCTCAGCCCT . 


AGAATGCACA 


7380 


GAGCAACAAC . 


AGTATAGCAG . 


AAGGAAGGTC 


CTCATCCTTA . 


AGTCTTTAAG 


TCAGGCTATT 


7440 


TGATTTTTTC 


TCCCTGCTCT ■ 


CTCTTCCTCT , 


ACACACACA 






7479 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
CGCGACCTCC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
TCCAGTCGAG 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GTGAGTCTCC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AATGTTATAG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GGTCGAGGGA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCATTGGAG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGAGGAGCA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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( D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TCCCTTATAG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGAAGCCCAC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCTAGACATG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GT GAG TACT A 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TTCCCACCAG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GAATACACCA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ACACAATTAG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GTATGTCAAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
TCCCTTCCAG 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GAT G AC CAT T 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TTCTCTAGCT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GTGAGTACCT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TCCTTTCCAG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TTTCCTATCC 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 8.: 
ACCCCAGTTG 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GTAAGCGTGC 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTCTTTTCAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTGACTTCAT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
ACCTCTCTAG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
GTAAGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCTCTTGCAG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGATCACCTC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY; linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
ACTCCGCCAT 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
GTATGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TCCATTTTAG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCTCGTATCA 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ATATATCAGC 1Q 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 41. .1558 

(ix) FEATURE: 

(A) NAME/KEY: sig__peptide 

(B) LOCATION: 41. . 94 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 95. .1558 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CGCGACCTCC GCGCAGGTGG TCGCGCCGGT CTCCGCGGAA ATG TTG TCC AAA GTT 55 

Met Leu Ser Lys Val 
-18 -15 

CTT CCA GTC CTC CTA GGC ATC TTA TTG ATC CTC CAG TCG AGG GTC GAG 103 
Leu Pro Val Leu Leu Gly He Leu Leu He Leu Gin Ser Arq Val Glu 
-10 -5 1 

GGA CCT CAG ACT GAA TCA AAG AAT GAA GCC TCT TCC CGT GAT GTT GTC 151 
Gly Pro Gin Thr Glu Ser Lys Asn Glu Ala Ser Ser Arg Asp Val Val 
5 10 15 

TAT GGC CCC CAG CCC CAG CCT CTG GAA AAT CAG CTC CTC TCT GAG GAA 199 
Tyr Gly Pro Gin Pro Gin Pro Leu Glu Asn Gin Leu Leu Ser Glu Glu 
20 25 30 35 

ACA AAG TCA ACT GAG ACT GAG ACT GGG AGC AGA GTT GGC AAA CTG CCA 247 
Thr Lys Ser Thr Glu Thr Glu Thr Gly Ser Arg Val Gly Lys Leu Pro 
40 45 50 
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GAA GCC TCT CGC ATC CTG AAC ACT ATC CTG AGT AAT TAT GAC CAC AAA 2 95 

Glu Ala Ser Arg lie Leu Asn Thr lie Leu Ser Asn Tyr Asp His Lys 
55 60 65 

CTG CGC CCT GGC ATT GGA GAG AAG CCC ACT GTG GTC ACT GTT GAG ATC 34 3 

Leu Arg Pro Gly lie Gly Glu Lys Pro Thr Val Val Thr Val Glu lie 
70 75 80 

GCC GTC AAC AGC CTT GGT CCT CTC TCT ATC CTA GAC ATG GAA TAC ACC 391 
Ala Val Asn Ser Leu Gly Pro Leu Ser lie Leu Asp Met Glu Tyr Thr 
85 90 95 

ATT GAC ATC ATC TTC TCC CAG ACC TGG TAC GAC GAA CGC CTC TGT TAC 4 39 

lie Asp lie lie Phe Ser Gin Thr Trp Tyr Asp Glu Arg Leu Cys Tyr 
100 105 110 115 

AAC GAC ACC TTT GAG TCT CTT GTT CTG AAT GGC AAT GTG GTG AGC CAG 487 
Asn Asp Thr Phe Glu Ser Leu Val Leu Asn Gly Asn Val Val Ser Gin 
120 125 130 

CTA TGG ATC CCG GAC ACC TTT TTT AGG AAT TCT AAG AGG ACC CAC GAG 535 
Leu Trp lie Pro Asp Thr Phe Phe Arg Asn Ser Lys Arg Thr His Glu 
135 140 145 

CAT GAG ATC ACC ATG CCC AAC CAG ATG GTC CGC ATC TAC AAG GAT GGC 5 83 

His Glu lie Thr Met Pro Asn Gin Met Val Arg He Tyr Lys Asp Gly 
150 155 160 

AAG GTG TTG TAC ACA ATT AGG ATG ACC ATT GAT GCC GGA TGC TCA CTC 631 
Lys Val Leu Tyr Thr He Arg Met Thr He Asp Ala Gly Cys Ser Leu 
165 170 175 

CAC ATG CTC AGA TTT CCA ATG GAT TCT CAC TCT TGC CCT CTA TCT TTC 67 9 

His Met Leu Arg Phe Pro Met Asp Ser His Ser Cys Pro Leu Ser Phe 
180 185 190 195 

TCT AGC TTT TCC TAT CCT GAG AAT GAG ATG ATC TAC AAG TGG GAA AAT 727 
Ser Ser Phe Ser Tyr Pro Glu Asn Glu Met He Tyr Lys Trp Glu Asn 
200 205 210 

TTC AAG CTT GAA ATC AAT GAG AAG AAC TCC TGG AAG CTC TTC CAG TTT 775 
Phe Lys Leu Glu He Asn Glu Lys Asn Ser Trp Lys Leu Phe Gin Phe 
215 220 225 

GAT TTT ACA GGA GTG AGC AAC AAA ACT GAA ATA ATC ACA ACC CCA GTT 823 
Asp Phe Thr Gly Val Ser Asn Lys Thr Glu He He Thr Thr Pro Val 
230 235 240 

GGT GAC TTC ATG GTC ATG ACG ATT TTC TTC AAT GTG AGC AGG CGG TTT 871 
Gly Asp Phe Met Val Met Thr He Phe Phe Asn Val Ser Arq Arq Phe 
245 250 255 

GGC TAT GTT GCC TTT CAA AAC TAT GTC CCT TCT TCC GTG ACC ACG ATG 919 
Gly Tyr Val Ala Phe Gin Asn Tyr Val Pro Ser Ser Val Thr Thr Met 
260 265 270 275 

CTC TCC TGG GTT TCC TTT TGG ATC AAG ACA GAG TCT GCT CCA GCC CGG 967 
Leu Ser Trp Val Ser Phe Trp He Lys Thr Glu Ser Ala Pro Ala Arg 
280 285 290 

ACC TCT CTA GGG ATC ACC TCT GTT CTG ACC ATG ACC ACG TTG GGC ACC 1015 
Thr Ser Leu Gly He Thr Ser Val Leu Thr Met Thr Thr Leu Gly Thr 
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295 300 305 

TTT TCT CGT AAG AAT TTC CCG CGT GTC TCC TAT ATC ACA GCC TTG GAT 1063 
Phe Ser Arg Lys Asn Phe Pro Arg Val Ser Tyr He Thr Ala Leu Asp 
310 315 320 

TTC TAT ATC GCC ATC TGC TTC GTC TTC TGC TTC TGC GCT CTG TTG GAG 1111 
Phe Tyr He Ala He Cys Phe Val Phe Cys Phe Cys Ala Leu Leu Glu 
325 330 335 

TTT GCT GTG CTC AAC TTC CTG ATC TAC AAC CAG ACA AAA GCC CAT GCT 115 9 

Phe Ala Val Leu Asn Phe Leu He Tyr Asn Gin Thr Lys Ala His Ala 
340 345 350 355 

TCT CCT AAA CTC CGC CAT CCT CGT ATC AAT AGC CGT GCC CAT GCC CGT 12 07 

Ser Pro Lys Leu Arg His Pro Arg He Asn Ser Arg Ala His Ala Arg 
360 365 370 

ACC CGT GCA CGT TCC CGA GCC TGT GCC CGC CAA CAT CAG GAA GCT TTT 1255 
Thr Arg Ala Arg Ser Arg Ala Cys Ala Arg Gin His Gin Glu Ala Phe 
375 380 385 

GTG TGC CAG ATT GTC ACC ACT GAG GGA AGT GAT GGA GAG GAG CGC CCG 1303 
Val Cys Gin He Val Thr Thr Glu Gly Ser Asp Gly Glu Glu Arg Pro 
390 395 400 

TCT TGC TCA GCC CAG CAG CCC CCT AGC CCA GGT AGC CCT GAG GGT CCC 1351 

Ser Cys Ser Ala Gin Gin Pro Pro Ser Pro Gly Ser Pro Glu Gly Pro 

W 405 410 415 

s " CGC AGC CTC TGC TCC AAG CTG GCC TGC TGT GAG TGG TGC AAG CGT TTT 13 99 

P1 Arg Ser Leu Cys Ser Lys Leu Ala Cys Cys Glu Trp Cys Lys Arg Phe 

5n 420 425 430 435 

f|J AAG AAG TAC TTC TGC ATG GTC CCC GAT TGT GAG GGC AGT ACC TGG CAG 14 4 7 

W L V S L Y S T Y r ph ^ Cys Met Val Pro Asp Cys Glu Gly Ser Thr Trp Gin 

%fj 440 445 450 

CAG GGC CGC CTC TGC ATC CAT GTC TAC CGC CTG GAT AAC TAC TCG AGA 14 95 

Gin Gly Arg Leu Cys He His Val Tyr Arg Leu Asp Asn Tyr Ser Arg 
455 460 465 

GTT GTT TTC CCA GTG ACT TTC TTC TTC TTC AAT GTG CTC TAC TGG CTT 1543 
Val Val Phe Pro Val Thr Phe Phe Phe Phe Asn Val Leu Tyr Trp Leu 
470 475 480 

GTT TGC CTT AAC TTG TAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 15 91 
Val Cys Leu Asn Leu * 
485 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG C AGC AG CAG C 1651 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAG AAG C T T G CAGAGGGTTT 1711 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTCTTCTCAG CAGACCATTT 1771 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 1831 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 18 91 

TAGCTATCTT CCAACAATGC TGACCACCAG AC AAT TAC TG CATTTTTCCA GAAGCCCACT 1951 
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ATTGCCTTTG TAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2 011 

TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA G GAG GAAGAG 2071 

ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 2131 

CTGCAGATAG ATAGACACTG GCATTATCCC TTOAGGAAGA GGGGGGGGCA GCAAGAGAGC 2191 

CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 2251 

CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 2311 

TGT GAT TATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 2371 

GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCCAGG CACTAGGCCT 24 31 

TGGTGACTTC CTGGGGCCAA GAAAC T AAGG AAACTCGGCT TTGCAACAGG CATTACTCGC 24 91 

CATTGATTGG TGCCCACCCA GGGCACACTG TCGGAGTTCT ATCACTTGCT TGACCCCTGG 2551 

ACCCATAAAC CAGTCCACTG TTATACCCGG GGCACTCTAA C CAT CAC AAT CAATCAATCA 2611 

AATTCCCTTA AATTTGTATG GCACTGGAAC TTTGGCAAAG CACTTTTGAC AAGTTGTGTC 2 671 

TGATTGGAGC T T CAT GAT AG CCTTGTGACA TCTTTAGGGC AGGATTCTTA TCCCCATTTT 2731 

GCAGAT GAAA ACCCTGAGTC ACAGATTTCT GTGGGACTGT GGATCTCACT GGAAGCTATC 27 91 

CAAGAGCCCA CTGTCACCTT CTAGACCACA TGATAGGGCT AGACAGCTCA GTTCACCATG 2 851 

ATTCTCTTCT GTCACCTCTG CTGGCACACC AGTGGCAAGG CCCAGAATGG CGACCTCTCT 2 911 

TTAGCTCAAT TTCTGGGCCT GAGGTGCTCA GACTGCCCCC AAGATCAAAT CTCTCCTGGC 2 971 

TGTAGTAACC CAGTGGAATG AATTTGGACA TGCCCCAATG CTTCTATATG CTAAGTGAAA 3031 

TCTGTGTCTG TAATTTGTTG GGGGGTGGAT AGGGTGGGGT CTCCATCTAC TTTTTGTCAC 3091 

CATCATCTGA AATGGGGAAA TATGTAAATA AAT AT AT C AG CAAAGCAAAA AGAAAAAAAA 3151 
AAA 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Leu Ser Lys Val Leu Pro Val Leu Leu Gly He Leu Leu He Leu 
"18 "15 -10 -5 

Gin Ser Arg Val Glu Gly Pro Gin Thr Glu Ser Lys Asn Glu Ala Ser 
15 10 

Ser Arg Asp Val Val Tyr Gly Pro Gin Pro Gin Pro Leu Glu Asn Gin 
15 20 25 30 
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Leu Leu Ser Glu 



Val Gly Lys Leu 
50 



Asn Tyr Asp His 
65 



Val Thr Val Glu 

80 

Asp Met Glu Tyr 
95 

Glu Arg Leu Cys 



Asn Val Val Ser 
130 

Lys Arg Thr His 
145 



lie Tyr Lys Asp 
160 



Ala Gly Cys Ser 
175 



Cys Pro Leu Ser 



Tyr Lys Trp Glu 
210 



Lys Leu Phe Gin 
225 

lie Thr Thr Pro 
240 



Val Ser Arg Arg 
255 



Ser Val Thr Thr 



Ser Ala Pro Ala 
290 

Thr Thr Leu Gly 
305 

lie Thr Ala Leu 
320 



Cys Ala Leu Leu 
335 

Thr Lys Ala His 



Glu Thr Lys Ser 
35 

Pro Glu Ala Ser 



Lys Leu Arg Pro 
70 

lie Ala Val Asn 
85 



Thr lie Asp lie 
100 



Tyr Asn Asp Thr 
115 



Gin Leu Trp lie 



Glu His Glu lie 
150 



Gly Lys Val Leu 
165 

Leu His Met Leu 
180 



Phe Ser Ser Phe 
195 



Asn Phe Lys Leu 



Phe Asp Phe Thr 
230 

Val Gly Asp Phe 
245 

Phe Gly Tyr Val 
260 



Met Leu Ser Trp 
275 



Arg Thr Ser Leu 



Thr Phe Ser Arg 
310 

Asp Phe Tyr lie 
325 

Glu Phe Ala Val 
340 

Ala Ser Pro Lys 
355 



Thr Glu Thr Glu 
40 



Arg lie Leu Asn 
55 



Gly He Gly Glu 



Ser Leu Gly Pro 
90 



He Phe Ser Gin 
105 



Phe Glu Ser Leu 
120 

Pro Asp Thr Phe 
135 

Thr Met Pro Asn 



Tyr Thr He Arg 
170 



Arg Phe Pro Met 
185 



Ser Tyr Pro Glu 
200 

Glu He Asn Glu 
215 



Gly Val Ser Asn 



Met Val Met Thr 
250 

Ala Phe Gin Asn 
265 

Val Ser Phe Trp 
280 



Gly He Thr Ser 
295 



Lys Asn Phe Pro 



Ala He Cys Phe 
330 

Leu Asn Phe Leu 
345 



Leu Arg His Pro 
360 



Thr Gly Ser Arg 
45 



Thr He Leu Ser 
60 

Lys Pro Thr Val 
75 

Leu Ser He Leu 



Thr Trp Tyr Asp 
110 

Val Leu Asn Gly 
125 



Phe Arg Asn Ser 
140 



Gin Met Val Arg 
155 



Met Thr He Asp 



Asp Ser His Ser 
190 



Asn Glu Met He 
205 

Lys Asn Ser Trp 
220 



Lys Thr Glu He 
235 



He Phe Phe Asn 



Tyr Val Pro Ser 
270 

He Lys Thr Glu 
285 

Val Leu Thr Met 
300 



Arg Val Ser Tyr 
315 



Val Phe Cys Phe 



He Tyr Asn Gin 
350 

Arg He Asn Ser 
365 
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Arg Ala His Ala Arg Thr Arg Ala Arg Ser Arg Ala Cys Ala Arg Gin 
370 375 380 

His Gin Glu Ala Phe Val Cys Gin He Val Thr Thr Glu Gly Ser Asp 
385 390 395 

Gly Glu Glu Arg Pro Ser Cys Ser Ala Gin Gin Pro Pro Ser Pro Gly 
400 405 410 

Ser Pro Glu Gly Pro Arg Ser Leu Cys Ser Lys Leu Ala Cys Cys Glu 
415 420 425 430 

Trp Cys Lys Arg Phe Lys Lys Tyr Phe Cys Met Val Pro Asp Cys Glu 
435 440 445 

Gly Ser Thr Trp Gin Gin Gly Arg Leu Cys He His Val Tyr Arg Leu 
450 455 460 

Asp Asn Tyr Ser Arg Val Val Phe Pro Val Thr Phe Phe Phe Phe Asn 
465 470 475 

fi Val Leu Tyr Trp Leu Val Cys Leu Asn Leu 

7*\ 480 485 

7^1 (2) INFORMATION FOR SEQ ID NO: 43: 

^ (i) SEQUENCE CHARACTERISTICS: 

ttj (A) LENGTH: 303 base pairs 

hi (B) TYPE: nucleic acid 

ft! (C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

!r: (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG TCCAAGCCCC 60 

TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG TTTTTCCTGC 12 0 

CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTTTTTG CTGCCCCTCT CCCCTACCTG 18 0 

GCCCATTCAC TGAGTCTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG GGCCACCTCC 24 0 

CTCTTCTTCA AG GAG CAT C C GTGATGCTCA GTGTTCAAAA CCACAGCCAC TTAGTGATCA 300 
GCT 

(2) INFORMATION FOR SEQ ID NO : 4 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



303 



-134- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GCTTTGCTGA TATATTTATT TACATATTTC CCCATTTCAG ATGATGGTGA CAAAAAGTAG 60 

ATGGAGACCC CACCCTATCC ACCCCCCAAC AAATTACAGA CACAGATTTC ACTTAGCATA 120 

TAGAAGCATT GGGGCATGTC CAAATTCATT CCACTGGGTT ACTACAGCCA GGAGAGATTT 18 0 

GATCTTGGGG GCAGTCTGGA GCACCT 206 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

off 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

0$ GGTACCTAAG TGAGTAGGGC GTCCGATCGA CGGACGCCTT TTTTTTGAAT TCGTAATCAT 60 

||1 GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 120 

^ CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG 18 0 

IU CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CAT TAAT GAA 24 0 

gl TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA 300 

g CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG 360 

TAATACGGTT AT CC ACAGAA TCAGGGGATA ACGCAGGAAA GAACAT GT GA GCAAAAGGCC 420 

AGCAAAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 4 80 

CCCCTGACGA GCAT CACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 54 0 

TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 600 

TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 660 

GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 72 0 

ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 780 

ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 84 0 

CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 900 

GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 960 

GTAGCTCTTG AT CCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 1020 

AGCAGAT TAC GCGCAGAAAA AAAGGATCTC AAGAAG AT C C TTTGATCTTT TCTACGGGGT 108 0 
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CTGACGCTCA 


GTGGAACGAA 


AACTCACGTT 


AAGGGATTTT 


GGT CAT GAGA 


TTATCGTCGA 


1140 


CAATTCGCGC 


GCGAAGGCGA 


AGCGGCATGC 


ATTTACGTTG 


AC AC CAT C G A 


ATGGTGCAAA 


1200 


ACCTTTCGCG 


GTATGGCATG 


ATAGCGCCCG 


GAAGAGAGTC 


AATTCAGGGT 


GGTGAATGTG 


1260 


AAACCAGTAA 


CGTTATACGA 


TGTCGCAGAG 


TATGCCGGTG 


TCTCTTATCA 


GACCGTTTCC 


1320 


CGCGTGGTGA 


ACCAGGCCAG 


CCACGTTTCT 


GCGAAAACGC 


GGGAAAAAGT 


GGAAGCGGCG 


1380 


ATGGCGGAGC 


TGAATTACAT 


TCCCAACCGC 


GTGGCACAAC 


AACTGGCGGG 


CAAACAGTCG 


1440 


TTGCTGATTG 


GCGTTGCCAC 


CTCCAGTCTG 


GCCCTGCACG 


CGCCGTCGCA 


AATTGTCGCG 


1500 


GCGATTAAAT 


CTCGCGCCGA 


TCAACTGGGT 


GCCAGCGTGG 


TGGTGTCGAT 


G G TAG AAC GA 


1560 


AGCGGCGTCG 


AAGCCTGTAA 


AGCGGCGGTG 


CACAATCTTC 


TCGCGCAACG 


CGTCAGTGGG 


1620 


CTGATCATTA 


ACTATCCGCT 


GGATGACCAG 


GAT GC CAT TG 


CTGTGGAAGC 


TGCCTGCACT 


1680 


AATGTTCCGG 


CGTTATTTCT 


TGATGTCTCT 


GACCAGACAC 


C CAT C AAC AG 


TATTATTTTC 


1740 


TCCCATGAAG 


ACGGTACGCG 


ACTGGGCGTG 


GAGCATCTGG 


TCGCATTGGG 


T C AC C AG C AA 


1800 


ATCGCGCTGT 


TAGCGGGCCC 


ATTAAGTTCT 


GTCTCGGCGC 


GTCTGCGTCT 


GGCTGGCTGG 


18 60 


CATAAATATC 


TCACTCGCAA 


TCAAATTCAG 


CCGATAGCGG 


AACGGGAAGG 


CGACTGGAGT 


1920 


GCCATGTCCG 


GTTTTCAACA 


AACCATGCAA 


ATGCTGAATG 


AGGGCATCGT 


TCCCACTGCG 


1980 


ATGCTGGTTG 


CCAACGATCA 


GATGGCGCTG 


GGCGCAATGC 


GCGCCATTAC 


CGAGTCCGGG 


2040 


CTGCGCGTTG 


GTGCGGATAT 


CTCGGTAGTG 


GGATACGACG 


ATACCGAAGA 


CAGCTCATGT 


2100 


TATATCCCGC 


CGTTAACCAC 


CAT CAAACAG 


GATTTTCGCC 


TGCTGGGGCA 


AACCAGCGTG 


2160 


GACCGCTTGC 


TGCAACTCTC 


TCAGGGCCAG 


GCGGTGAAGG 


GCAATCAGCT 


GTTGCCCGTC 


2220 


TCACTGGTGA 


AAAGAAAAAC 


CACCCTGGCG 


CCCAATACGC 


AAACCGCCTC 


TCCCCGCGCG 


2280 


TTGGCCGATT 


CATTAATGCA 


GCTGGCACGA 


CAGGTTTCCC 


GACTGGAAAG 


CGGGCAGTGA 


2340 


GCGCAACGCA 


ATTAATGTAA 


GTTAGCGCGA 


ATTGTCGACC 


AAAGCGGCCA 


TCGTGCCTCC 


24 00 


CCACTCCTGC 


AGTTCGGGGG 


CATGGATGCG 


CGGATAGCCG 


CTGCTGGTTT 


CCTGGATGCC 


24 60 


GACGGATTTG 


CACTGCCGGT 


AGAACTCCGC 


GAGGTCGTCC 


AGCCTCAGGC 


AGCAGCTGAA 


2520 


CCAACTCGCG 


AGGGGATCGA 


GCCCGGGGTG 


GGCGAAGAAC 


TCCAGCATGA 


GATCCCfGCG 


i. o u 


CTGGAGGATC 


ATCCAGCCGG 


CGTCCCGGAA 


AACGATTCCG 


AAGCCCAACC 


TTTCATAGAA 


2 64 0 




AA I C G AAAT 


CTCGTGATGG 


CAGGTTGGGC 


GTCGCTTGGT 


CGGTCATTTC 


2700 


GAACCCCAGA 


GTCCCGCTCA 


GAAGAACTCG 


TCAAGAAGGC 


GATAGAAGGC 


GATGCGCTGC 


2760 


GAATCGGGAG 


CGGCGATACC 


GTAAAGCACG 


AGGAAGCGGT 


CAGCCCATTC 


GCCGCCAAGC 


2820 


TCTTCAGCAA 


TATCACGGGT 


AGCCAACGCT 


ATGTCCTGAT 


AGCGGTCCGC 


CACACCCAGC 


2880 


CGGCCACAGT 


CGATGAATCC 


AGAAAAGCGG 


CCATTTTCCA 


CC AT GAT ATT 


CGGCAAGCAG 


2940 



-136- 





GCATCGCCAT 


GGGTCACGAC 


GAGATCCTCG 


CCGTCGGGCA 


TGCGCGCCTT 


GAGCCTGGCG 


3000 




AACAGTTCGG 


CTGGCGCGAG 


CCCCTGATGC 


TCTTCGTCCA 


GATCATCCTG 


ATCGACAAGA 


3060 




CCGGCTTCCA 


TCCGAGTACG 


TGCTCGCTCG 


ATGCGATGTT 


TCGCTTGGTG 


GTCGAATGGG 


3120 




CAGGTAGCCG 


GATCAAGCGT 


ATGCAGCCGC 


CGCATTGCAT 


CAGCCATGAT 


GGATACTTTC 


3180 




TCGGCAGGAG 


CAAGGTGAGA 


TGACAGGAGA 


TCCTGCCCCG 


GCACTTCGCC 


CAATAGCAGC 


3240 




CAGTCCCTTC 


CCGCTTCAGT 


GACAACGTCG 


AGCACAGCTG 


CGCAAGGAAC 


GCCCGTCGTG 


3300 




GCCAGCCACG 


ATAGCCGCGC 


TGCCTCGTCC 


TGCAGTTCAT 


TCAGGGCACC 


GGACAGGTCG 


3360 




GTCTTGACAA 


AAAGAACCGG 


GCGCCCCTGC 


GCTGACAGCC 


GGAACACGGC 


GGCATCAGAG 


3420 




CAGCCGATTG 


TCTGTTGTGC 


CCAGTCATAG 


CCGAATAGCC 


TCTCCACCCA 


AGCGGCCGGA 


3480 




GAACCTGCGT 


GCAATCCATC 


TTGTTCAATC 


ATGCGAAACG 


ATCCTCATCC 


TGTCTCTTGA 


3540 




TCAGATCTTG 


ATCCCCTGCG 


CCATCAGATC 


CTTGGCGGCA 


AGAAAGCCAT 


CCAGTTTACT 


3600 




TTGCAGGGCT 


TCCCAACCTT 


ACCAGAGGGC 


GCCCCAGCTG 


GCAATTCCGG 


TTCGCTTGCT 


3660 


;l t 


GTCCATAAAA 


CCGCCCAGTC 


TAGCTATCGC 


CATGTAAGCC 


CACTGCAAGC 


TACCTGCTTT 


3720 


93 


CTCTTTGCGC 


TTGCGTTTTC 


CCTTGTCCAG 


ATAGCCCAGT 


AGCTGACATT 


CATCCGGGGT 


37 8 0 


S! 


CAGCACCGTT 


TCTGCGGACT 


GGCTTTCTAC 


GTGTTCCGCT 


TCCTTTAGCA 


GCCCTTGCGC 


3840 




CCTGAGTGCT 


TGCGGCAGCG 


TGAAGCTTAA 


AAAACTGCAA 


AAAATAGTTT 


GACTTGTGAG 


3900 


fi 

S% \ 


CGGATAACAA 


TTAAGATGTA 


CCCAATTGTG 


AGCGGATAAC 


AATTTCACAC 


ATTAAAGAGG 


3960 


was i 


AGAAAT T AC A 


TATG 










397 4 




(2) INFORMATION FOR SEQ ID NO : 4 6 : 












(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 










(ii) MOLECULE TYPE 


; : cDNA 












(xi). SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 










AAGCTTAAAA 


AACTGCAAAA 


AATAGTTTGA 


CTTGTGAGCG 


GATAACAATT 


AAGATGTACC 


60 




CAATTGTGAG 


CGGATAACAA 


TTTCACACAT 


TAAAGAGGAG 


AAAT TACAT A 


TG 


112 



What Is Claimed Is: 
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L An isolated nucleic acid molecule comprising a polynucleotide 
havirfg a nucleotide sequence at least 95% identical to a sequence selected from 
the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 1 to about 242 in SEQ ID NO:2; 

(b) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 2 to about 242 in SEQ ID NO:2; 

(c) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 15 to about 36 in SEQ ID NO:2; 

(d) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 42 to about 62 in SEQ ID NO:2; 

(e) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 75 to about 95 in SEQ ID NO:2; 

(f) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 219 to about 240 in SEQ ID NO:2; 

(g) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 96 to about 218 in SEQ ID NO:2; 

(h) a nucleotide sequence encoding the ET2 receptor subunit 
with all or part of one or more of the transmembrane domains deleted; and 

(i) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), (g), or (h). 

2. An isolated nucleic acid molecule comprising a polynucleotide 
which hybridizes under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence identical to a nucleotide sequence in (a), (b), (c), (d), 
( e )> (f)> (g)» (h), or (i) of claim 1 wherein said polynucleotide which hybridizes 
does not hybridize under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence consisting of only A residues or of only T residues. 
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3y An isolated nucleic acid molecule comprising a polynucleotide 
having a sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) the nucleotide sequence of a fragment of the sequence 
shown in SEQ ID NO: 1 , wherein said fragment comprises at least 20 contiguous 
nucleotides of SEQ ID NO:l 3 provided that said nucleotide sequence is not 
HUKAU66R (SEQ ID NO:43) and HPLBB96F (SEQ ID NO:44), or any 
subfragment thereof; and 

(b) a nucleotide sequence complementary to a nucleotide 
sequence in (a). 

4. The nucleotide sequence of a fragment of claim 3, wherein said 
fragment comprises at least 50 contiguous nucleotides of SEQ ID NO:l. 

5. A method for making a recombinant vector comprising inserting 
an isolated nucleic acid molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 6 into a host cell. 

8. A recombinant host cell produced by the method of claim 7. 

9. A recombinant method for producing a polypeptide, comprising 
culturing the recombinant host cell of claim 8 under conditions such that said 
polypeptide is expressed and recovering said polypeptide. 

10. / An isolated polypeptide having an amino acid sequence at least 

/ • 

95% identical to a sequence selected from the group consisting of: 
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(a) amino acids from about 1 to about 242 in SEQ ID NO:2; 

(b) amino acids from about 2 to about 242 in SEQ ID NO:2; 

(c) amino acids from about 15 to about 36 in SEQ ID NO:2; 

(d) amino acids from about 42 to about 62 in SEQ ID NO:2; 

(e) amino acids from about 75 to about 95 in SEQ ID NO:2; 

(f) amino acids from about 2 1 9 to about 240 in SEQ ID NO :2; 

(g) amino acids from about 96 to about 21 8 in SEQ ID NO:2; 

(h) the amino acid sequence of the ET2 polypeptide with all or 
part of one or more of the transmembrane domains deleted; and 

(i) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), (e), (f), (g), or (h). 

11. An isolated antibody that binds specifically to a polypeptide of 
claim 10. 



12^^An isolated nucleic acid molecule comprising a polynucleotide 
encoding an ET2 polypeptide wherein, except for at least one conservative amino 
acid substitution, said polypeptide has a sequence selected from the group 
consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 1 to about 242 in SEQ ID NO:2; 

(b) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 2 to about 242 in SEQ ID NO:2; 

(c) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 15 to about 36 in SEQ ID NO:2; 

(d) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 42 to about 62 in SEQ ID NO:2; 

(e) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 75 to about 95 in SEQ ID NO:2; 
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(f) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 219 to about 240 in SEQ ID NO:2; 

(g) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 96 to about 218 in SEQ ID NO:2; 

(h) a nucleotide sequence encoding the ET2 receptor subunit 
with all or part of one or more of the transmembrane domains deleted; and 

(i) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), (g) ? or (h). 

13, An isolated polypeptide wherein, except for at least one 
conservative amino acid substitution, said polypeptide has a sequence selected 
from the group consisting of: 

(a) amino acids from about 1 to about 242 in SEQ ID NO:2; 

(b) amino acids from about 2 to about 242 in SEQ ID NO:2; 

(c) amino acids from about 15 to about 36 in SEQ ID NO:2; 

(d) amino acids from about 42 to about 62 in SEQ ID NO:2; 

(e) amino acids from about 75 to about 95 in SEQ ID NO:2; 

(f) amino acids from about 2 1 9 to about 240 in SEQ ID NO:2; 

(g) amino acids from about 96 to about 21 8 in SEQ ID NO:2; 

(h) the amino acid sequence of the ET2 polypeptide with all or 
part of one or more of the transmembrane domains deleted; and 

(i) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), (e), (f), (g), or (h). 

14^ An isolated nucleic acid molecule comprising a polynucleotide 
having a nucleotide sequence at least 95% identical to a sequence selected from 
the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about -18 to about 488 in SEQ ID NO:42; 
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(b) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about -17 to about 488 in SEQ ID NO:42; 

(c) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 1 to about 488 in SEQ ID NO:42; 

(d) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 1 to about 260 in SEQ ID NO:42; 

(e) a nucleotide sequence encoding the GABRE receptor 
subunit with all or part of one or more of the transmembrane domains deleted; 

(f) a nucleotide sequence encoding a polypeptide having the 
amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 



(g) a nucleotide sequence encoding the mature GABRE 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. ; and 

(h) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g). 

15. An isolated nucleic acid molecule comprising a polynucleotide 
which hybridizes under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence identical to a nucleotide sequence in (a), (b), (c), (d), 

(f)> (g)> °r (h) of claim 14 wherein said polynucleotide which hybridizes does 
not hybridize under stringent hybridization conditions to a polynucleotide having 
a nucleotide sequence consisting of only A residues or of only T residues. 

16, -^^An isolated nucleic acid molecule comprising a polynucleotide 
having a sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) the nucleotide sequence of a fragment of the sequence 
shown in SEQ ID NO:4 1 , wherein said fragment comprises at least 20 contiguous 
nucleotides of SEQ ID NO:41, provided that said nucleotide sequence is not 
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HUKAU66R (SEQ ID NO:43) and HPLBB96F (SEQ ID NO:44), or any 
subfragment thereof; and 

(b) a nucleotide sequence complementary to a nucleotide 
sequence in (a). 

1 7. The nucleotide sequence of a fragment of claim 1 6, wherein said 
fragment comprises at least 50 contiguous nucleotides of SEQ ID NO:41 . 

18. A method for making a recombinant vector comprising inserting 
an isolated nucleic acid molecule of claim 14 into a vector. 

19. A recombinant vector produced by the method of claim 18. 

20. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 19 into a host cell. 

21. A recombinant host cell produced by the method of claim 20. 

22. A recombinant method for producing a polypeptide, comprising 
culturing the recombinant host cell of claim 21 under conditions such that said 
polypeptide is expressed and recovering said polypeptide. 

23^ An isolated polypeptide having an amino acid sequence at least 
95%tMentical to a sequence selected from the group consisting of: 



(a) 



amino acids from about -18 to about 488 in SEQ ID 



NO:42; 



(b) 



amino acids from about -17 to about 488 in SEQ ID 



NO:42; 



(c) 
(d) 



amino acids from about 1 to about 488 in SEQ ID NO:42; 
amino acids from about 1 to about 260 in SEQ ID NO:42; 
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(e) the amino acid sequence of the GABRE polypeptide with 
all or part of one or more of the transmembrane domains deleted; and 

(f) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), or (e). 

24. An isolated antibody that binds specifically to a polypeptide of 
claim 23. 

25/"" An isolated nucleic acid molecule comprising a polynucleotide 
encoding a GABRE polypeptide wherein, except for at least one conservative 
amino acid substitution, said polypeptide has a sequence selected from the group 
consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about -18 to about 488 in SEQ ID NO:42; 

(b) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about -17 to about 488 in SEQ ID NO:42; 

(c) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 1 to about 488 in SEQ ID NO:42; 

(d) a nucleotide sequence encoding a polypeptide comprising 
amino acids from about 1 to about 260 in SEQ ID NO:42; 

(e) a nucleotide sequence encoding the GABRE receptor 
subunit with all or part of one or more of the transmembrane domains deleted; 

(f) a nucleotide sequence encoding a polypeptide having the 
amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 



(g) a nucleotide sequence encoding the mature GABRE 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. ; and 

(h) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g). 
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^26^ An isolated polypeptide wherein, except for at least one 
conservative amino acid substitution, said polypeptide has a sequence selected 
from the group consisting of: 

(a) amino acids from about -18 to about 488 in SEQ ID 

(b) amino acids from about -17 to about 488 in SEQ ID 

(c) amino acids from about 1 to about 488 in SEQ ID NO:42; 

(d) amino acids from about 1 to about 260 in SEQ ID NO:42; 

(e) the amino acid sequence of the GABRE polypeptide with 
all or part of one or more of the transmembrane domains deleted; and 

(f) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), or (e). 



NO:42; 
NO:42; 
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ET2: An Alternative Transcript of the GABA A 
Receptor e Subunit 



Abstract 



The present invention relates to a novel GABA A receptor e subunit 
(GABRE) and an alternative transcript thereof (ET2). More specifically, isolated 
nucleic acid molecules are provided encoding human GABRE and ET2 receptor 
subunits. ET2 and GABRE polypeptides are also provided, as are vectors, host 
cells and recombinant methods for producing the same. The invention further 
relates to screening methods for identifying agonists and antagonists of ET2 and 
GABRE activities. Also provided are diagnostic methods for detecting aberrant 
GABRE and ET2 expression, as well as therapeutic methods for treating disorders 
involving ET2 and GABRE. 
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